VTuber setup with phone-as-webcam: budget face tracking that actually works (2026)

· ~10 min read

Disclosure: ChargeCast (Android-screen-to-OBS) is one of the tools mentioned in this guide and we make it. The face-tracking part of the setup is solved by other people's tools, which we'll credit honestly.

TL;DR

Your phone's front camera is almost always better than a $30–50 webcam for VTube Studio or VSeeFace face tracking — better optics, better low-light, and on iPhone, dedicated depth hardware (ARKit). DroidCam is the bridge that turns the phone into a webcam input. If you also play mobile games on stream, pair it with a screen-mirroring tool so the avatar and the gameplay land in OBS together.

Why face tracking quality depends on the camera

VTube Studio, VSeeFace, and other VTuber tools all do roughly the same thing: take a video frame, find facial landmarks (eyebrows, eye corners, mouth, jaw), and translate movement into avatar bones. The avatar's expressiveness — whether a small smirk reads as a smile or as nothing — is bounded by how cleanly the software can see your face.

That means three things matter:

  1. Sensor quality. A 720p budget webcam in dim lighting feeds a noisy frame. The tracker fights noise instead of tracking your face.
  2. Frame rate. A camera that delivers a steady 30fps is better than one that drops to 15fps under low light. VTube Studio interpolates, but interpolation can't recover what wasn't captured.
  3. Depth or stereo info (bonus). iPhone's TrueDepth gives ARKit real depth maps, which lets it pick up subtle blendshapes that 2D landmark detection misses.

Why phones beat budget webcams

The phone in your pocket has more camera engineering in it than the entire budget-webcam market combined.

$30 webcam$50 webcamMid-range AndroidiPhone (recent)
Sensor size1/4" or smaller1/4"–1/3"~1/3"1/2.7"–1/2"
Low-light handlingPoorMediocreGoodExcellent
Stable 30fpsSometimesYesYesYes (often 60fps)
Auto-focusFixed or slowSlowFastFast (with depth)
Depth dataNoneNoneSome flagshipsTrueDepth (ARKit)
Cost beyond what you own$30$50$0$0

The "$0" column is the killer. If you're a starting VTuber on a budget, the gear you already have outperforms the gear you'd add.

The basic setup: phone as webcam

The bridge between "my phone's camera" and "my VTuber software" is a webcam-emulation app. Two main choices:

DroidCam (Android + iPhone, paid Pro tier)

Detailed comparison here. The free tier caps at 480p, the Pro tier ($14.99 lifetime) unlocks 1080p and removes the watermark. Connects via USB or Wi-Fi. Reliable, well-supported, plenty of community guides for VTube Studio configs.

iPhone Continuity Camera (macOS only, free)

If you're on a Mac with a recent iPhone, macOS treats the iPhone as a system-level webcam with no third-party app needed. ARKit data isn't exposed to most VTuber software through this path, but the raw video quality is excellent. For the dedicated ARKit pipeline, look at face-tracking apps that run on the iPhone and stream blendshape data over the network (iFacialMocap, MeowFace).

Iriun Webcam, EpocCam, etc.

Several alternatives exist. They mostly differ in pricing model and platform support. DroidCam tends to be the recommended default in VTuber Discord servers for Windows users.

VTube Studio + DroidCam: the actual config

Once DroidCam is feeding a webcam input to your PC, VTube Studio sees it as a regular webcam. Setup is straightforward, but there are two settings that matter:

  1. Video resolution. 720p at 30fps is plenty for face tracking. Going to 1080p doesn't improve tracking — it just costs CPU on both ends.
  2. Camera input selection. In VTube Studio, the dropdown will list "DroidCam Source 3" (or similar). Pick that, not your built-in laptop webcam.

One gotcha: if you also have a built-in laptop webcam, OBS or VTube Studio may default to it on every launch. Save the camera selection in your VTube Studio profile so it doesn't switch back.

Where ChargeCast fits — for mobile-gaming VTubers

If your stream is "VTuber avatar reacting to me playing FFXIV" — i.e. PC games — you don't need ChargeCast. Your face → DroidCam → VTube Studio → OBS, and your gameplay is captured natively from the PC.

If your stream is "VTuber avatar reacting to me playing Genshin Impact / Honkai / a Pokémon game on Switch through Android emulator / a Korean mobile gacha" — i.e. mobile games — you need a way to get the phone's screen into OBS too. That's where ChargeCast comes in.

The full setup looks like:

Phone #1 (camera-pointed-at-you)
  ↓ DroidCam (USB or Wi-Fi)
  ↓
PC ← VTube Studio (webcam input → avatar)
       ↓
       OBS scene:
         Avatar layer (VTube Studio Spout/Syphon)
         Game layer (ChargeCast Window Capture)
         Mic + game audio (ChargeCast 3-channel mixer)

Phone #2 (running the mobile game)
  ↓ ChargeCast (USB)
  ↓
PC ← OBS Window Capture

Two phones, two cables. One is the camera, one is the game source. They don't conflict because they're different USB sessions claiming different things.

If you only have one phone and want to do this: use the phone as the camera, and a Switch / iPad / second device as the game source via a capture card. Or stream the same phone's screen with a tool like ChargeCast and use a separate USB or built-in webcam for your face. The "two phones" pattern is just the cleanest decoupling.

Honest trade-offs of the phone-as-webcam approach

Pros

Cons

Pure-mobile VTuber setups (no PC)

A growing pattern, especially among Japanese individual VTubers: stream entirely from the phone. Apps like REALITY, IRIAM, and 17LIVE include avatar rendering and face tracking on-device. No PC, no webcam, no OBS.

This is a different niche. The trade-off is platform lock-in (you stream to the platform, not to Twitch/YouTube), and customization is bounded by what the platform allows. If you want full control of the stream, the PC-based setup we described is the way. If you want zero setup and don't care about platform diversity, mobile-only is fine.

Quick decision tree

Streaming a mobile game as a VTuber?

ChargeCast handles the phone-screen-to-OBS half of the setup, with audio mixing built in. Free trial, Microsoft Store distribution.

▶ Try ChargeCast on Microsoft Store

or download the .exe directly