Your AI VTuber co-host that actually pays attention. Watches your screen, hears your audio, reads your stream chat, and reacts with 84 animations. Custom VRM avatars. Full VTuber face rig. Fully offline capable. No API keys required.

Sign in to add this item to your wishlist, follow it, or mark it as ignored

Early Access Software

Get involved with this software as it develops.

Note: This Early Access software is not complete and may or may not change further. If you are not excited to use this software in its current state, then you should wait to see if it progresses further in development. Learn more

What the developers have to say:

Why Early Access?

“Vtuber Assistant is a passion project born from wanting a real AI companion that could both act as a desktop pet and double as a beginner-friendly VTuber rig. Early Access lets me involve the community early—your feedback on AI reactivity, tracking polish, pluggable integrations, animation layering, and overall usability will directly shape updates. This is the best way to build something that truly fits what streamers, VTubers, and casual users actually want.”

Approximately how long will this software be in Early Access?

“We plan for Early Access to last 6–12 months, depending on community feedback and feature pace. The goal is to reach a stable 1.0 with core polish, expanded reactivity, and robust pluggable support, but the timeline will flex based on what matters most to users.”

How is the full version planned to differ from the Early Access version?

“We plan to expand scope in these areas during Early Access and beyond:

-Deeper AI brain integration (more contextual synthesis, better multi-input understanding).
-Expanded reaction library (pre-made & user-contributed animations, expressions, VFX).
-More pluggable options (additional model support, advanced TTS/STT integrations).
-UI/UX polish (hotkey customization, preset saves, better onboarding for beginners).
-Performance & stability improvements across Windows, Linux, macOS, and Steam Deck.
-We won't make hard promises on exact timelines or features—development will evolve with your input.”

What is the current state of the Early Access version?

“The Early Access version is fully playable and feature-complete for core use cases:

-Drag-and-drop desktop overlay with resize/positioning (OBS-friendly).
-Toggle between user-controlled facial tracking (webcam, overlay mask, extra bones, sliders/toggles) and AI co-host mode.
-Live reactivity to system audio, visual screen capture, Twitch chat, and voice/text input.
Generative AI responses (local default + user-pluggable online), TTS voice output, layered animations under tracking.
-Custom personality prompts, safety guard, and modular pluggable system.
It's stable for daily use (streaming, gaming, casual hanging out), but will continue to receive polish, bug fixes, and new features based on feedback.”

Will this software be priced differently during and after Early Access?

“I want to be clear, I do not plan to increase the price after early access. If anything I have thoughts to lower it depending on how wide spread this goes. To be honest I am playing this by ear.”

How are you planning on involving the Community in your development process?

“Community input is central:

-Steam reviews, discussions, and feedback threads will guide priorities (e.g., which integrations or reactions matter most).
-In-app feedback forms and Discord (if created) for direct suggestions.
Regular devlogs/news updates on Steam showing what's being worked on and why.
-Beta branches for testing new features before wider release.
-Roadmap visibility: I'll share planned additions and adjust based on what users vote for or request most.”
Read more

Buy Vtuber Assistant

 
See all discussions

Report bugs and leave feedback for this software on the discussion boards

EARLY ACCESS LAUNCH!!! – Shape the Future!

Version 0.3.3 is live.

The app now has full UI localization in Japanese, Korean, and Indonesian alongside English. Every button, label, and menu switches in real time when you change your language in settings.

Avaturn is now built in. Create a realistic face-tracked avatar from a single selfie without leaving the app. It auto-converts to VRM format and loads immediately.

The Dual Avatar System is live. Spawn your own VRM character on screen alongside the AI. Your webcam drives your model in real time. The AI avatar keeps running independently. Both on screen at once.

Long-term memory is live. The AI now remembers things between sessions through per-profile memory files. Tell it your channel name and running jokes once and it remembers them forever.

Let's build something wild together

— Draven

About This Software

VTuber AI Assistant is a desktop AI companion that actually knows what you are doing. It watches your screen, listens to your system audio, reads your Twitch or YouTube Live chat, and responds through a fully animated VRM avatar with real personality. All of it can run locally if you want.

Not a desktop pet. Not a simple chatbot. A fully aware AI sidekick built for streamers, VTubers, and anyone who wants a desktop companion that genuinely pays attention.

It also works as a beginner-friendly VTuber rig, tracking your face through your webcam and driving an animated avatar in real time.

WHAT SHIPS WITH THE APP

-Everything works out of the box. No accounts or API keys required.

-Built-in AI brain

-A bundled AI model runs entirely on your machine. No internet, no setup, no keys. Just launch and start talking.

-Built-in voice

-Piper TTS speaks responses out loud, fully offline, with the avatar mouth animating as it talks.

-Built-in screen vision

-A local vision AI watches your screen using CPU. No GPU required. It sees what you are playing or watching and reacts with contextual commentary.

-84 animations

Happy, dance, sad, angry, think, scared, action, and more. The AI picks them automatically based on emotion, or you can trigger any of them manually at any time.

-Custom VRM support

-Import any VRM 0.0 or 1.0 model. Build one in VRoid Studio, grab one from VRoid Hub, create one from a selfie using the built-in Avaturn tool, or bring your own.

YOUR AI COMPANION

-Personality you control

-Set your companion's name, job, and personality traits. A pirate, a study buddy, a hype machine, whatever fits your vibe.

-AI Profiles

-Save multiple named character presets and switch between them instantly.

-Long-term memory

-Every profile has a persistent memory file. Write in your channel name, your regulars, running jokes, and what you are currently playing. The AI reads it at the start of every session so you never have to re-explain yourself.

-Live memory editor

-Update what the AI knows mid-session without restarting anything.

STREAM CO-HOST

-Twitch and YouTube Live

-Connects to your live chat in real time. No OAuth required for Twitch.

-Calls viewers by name

-The AI addresses chatters by their username when responding.

-Donation and sub callouts

-Twitch Bits, subs, resubs, gift subs, and raids. YouTube SuperChats and memberships. Every event triggers a dedicated animated reaction. Every donor gets called out every time, no exceptions.

-Response rate control

-Set how often the AI responds to chat so it never floods your stream.

FULL AWARENESS MODE

-Screen Vision

-Periodic screenshots are sent to a local vision AI. It reacts to what you are playing, watching, or browsing with contextual in-character commentary.

-Audio Reactions

-Listens to your music, game sounds, and video dialogue through your speakers. Reacts to beats, action sounds, and spoken content via local Whisper transcription.

-Voice Input

-Push-to-talk or always-on listening mode. Whisper runs locally. Works in the background with global hotkeys.

-Web Search

-Ask a question in natural language and the AI searches DuckDuckGo and summarizes the results in character.

-Big Brain Mode

When screen vision, audio reactions, voice input, and stream chat are all active at once, the AI combines all of it into compound reactions that reference multiple things simultaneously.

FACE TRACKING AND VTUBER RIG

-Use this app as your actual VTuber face rig. Your webcam drives the avatar in real time through MediaPipe. Head rotation, eye movement, blinks, and jaw are all tracked locally. No video ever leaves your machine.

-Dual Avatar System

-Spawn your own VRM character on screen alongside the AI avatar. Your face drives your model. The AI keeps running completely independently. Both avatars are on screen at the same time, fully animated.

-Per-axis smoothing, gaze offset, blink sensitivity, and upper body sway settings give you full control over how the tracking feels.

-Face Emotion Detection

-Your facial expressions automatically trigger body animations on your avatar. Smile and your character plays the happy animation. Angry brows trigger the angry reaction.

DESKTOP INTEGRATION

-Gaming Overlay Mode

-Sits on your desktop without a window frame. Survives fullscreen game focus stealing.

-Click-through mode

-The avatar stays visible but does not block any clicks to apps behind it.

-OBS integration

-Connects to OBS via WebSocket for scene-aware reactions.

-Global hotkeys

-Work even when the app is running in the background.

PRIVACY FIRST

-Every feature has a fully local option. Nothing is required to leave your machine.

-No telemetry. No analytics. No cloud dependency.

-All AI, screen vision, voice input, face tracking, and audio processing run on your device unless you choose cloud features.

-High-quality cloud options including Grok AI and ElevenLabs voice are available through an in-app credit system, but the app works completely without them.

BRING YOUR OWN AVATAR

-Import any VRM 0.0 or 1.0 model from VRoid Hub, Booth.pm, or VRoid Studio.

-Create a realistic avatar from a single selfie using the built-in Avaturn integration with automatic VRM conversion.

-Add custom animations by downloading free ones from Mixamo, dropping them in the animations folder, and restarting. Done.

COMING NEXT

Steam Workshop support for sharing and downloading community-made character profiles and personalities.

More avatar creation and customization tools.

Perfect for streamers who want a co-host that actually pays attention, VTubers who want their AI to feel alive, Begginer frinedly Vtuber rig, and anyone who thinks their desktop companion should know what game they're playing.

And me the Dev team!
-I got my own AI on my comp, helps me with discord, emails, bug reports, steam discussions. Me and my Me team will be there along side you to developing this project into something beautiful. 

Not a brainless pet. Fully yours. Way more fun.

AI Generated Content Disclosure

The developers describe how their game uses AI Generated Content like this:

Vtuber Assistant uses generative AI to create live text responses and voice output as a customizable co-host companion. It reacts to your voice/text input, screen audio/visuals, and Twitch chat with fun, contextual commentary.

All core functionality runs through a secure built-in proxy by default (Grok + ElevenLabs integration) — no external API keys or setup required to use the app. Free limited daily uses are included.

Optional premium features (unlimited generations, premium voices) are handled via Steam microtransactions (AI Credits packs). Local fallback models (bundled SmolLM2 + Piper TTS) provide offline functionality when needed.

Twitch chat integration is read-only — the AI consumes incoming messages and produces in-app responses only.

All outputs are moderated by a built-in Safety Guard to help prevent inappropriate content.

System Requirements

Windows
SteamOS + Linux
    Minimum:
    • OS: Windows 10 (64-bit) or later
    • Processor: Intel Core i5 6th Gen / AMD Ryzen 3 or equivalent
    • Memory: 8 GB RAM
    • Graphics: Integrated GPU with WebGL 2.0 support (Intel UHD 630 / AMD Radeon RX 550 or better)
    • Storage: 2 GB available space
    • Sound Card: Any
    • Additional Notes: Webcam (720p+) required for face tracking; Microphone for voice input. Works fully offline with bundled AI.
    Recommended:
    • OS: Windows 11 (64-bit)
    • Processor: Intel Core i7 9th Gen+ / AMD Ryzen 5 5600X+ or better
    • Memory: 16 GB RAM
    • Graphics: Dedicated GPU (NVIDIA GTX 1660 / AMD RX 5600 XT or equivalent)
    • DirectX: Version 12
    • Network: Broadband Internet connection
    • Storage: 6 GB available space
    • Sound Card: Any
    • Additional Notes: Dedicated GPU accelerates vision AI models. 1080p webcam recommended.
    Minimum:
    • OS: Ubuntu 20.04+ / Fedora 38+ / SteamOS 3+ or equivalent modern distro
    • Processor: Intel Core i5 6th Gen / AMD Ryzen 3 or equivalent
    • Memory: 8 GB RAM
    • Graphics: GPU with WebGL 2.0 support (Intel/AMD integrated or NVIDIA/AMD dedicated)
    • Storage: 2 GB available space
    • Sound Card: Any
    • Additional Notes: Webcam (720p+) and microphone required. Tested on Bazzite/Steam Deck. Flatpak or AppImage recommended for immutable distros.
    Recommended:
    • OS: Latest Ubuntu / Fedora / SteamOS
    • Processor: Intel Core i7 9th Gen+ / AMD Ryzen 5 5600X+
    • Memory: 16 GB RAM
    • Graphics: Dedicated GPU (NVIDIA/AMD with ROCm/CUDA support preferred)
    • Network: Broadband Internet connection
    • Storage: 6 GB available space
    • Sound Card: Any
    • Additional Notes: GPU acceleration for AI models; Steam Deck compatible in desktop mode.

Customer reviews for Vtuber Assistant About user reviews Your preferences

Overall Reviews:
1 user reviews (1 reviews)






To view reviews within a date range, please click and drag a selection on a graph above or click on a specific bar.




Filter reviews by the user's playtime when the review was written:



No minimum to No maximum

Show reviews in selected display order





Learn More
Filters
Excluding Off-topic Review Activity
Playtime:
Played Mostly on Steam Deck
Operating System:
CPU:
GPU:
Device Type: