Phi-3 in your browser.
10 shaders. Zero TVM.

A 3.6B-parameter LLM running entirely on your GPU via WebGPU. No compiler, no WASM runtime, no server. Just 10 hand-written WGSL compute shaders and 792 lines of GPU code, replacing the 85 auto-generated shaders that TVM/WebLLM normally need.

10
WGSL shaders
792
lines of GPU code
14KB
JS bundle
Try it yourself

Chrome or Edge only — all 10 shaders use enable f16 which Safari and Firefox don't support yet.
~2 GB model download on first load, cached after that.

How it compares
WebLLM (TVM)This project
Unique shaders8510
WGSL lines12,962792
JS bundle6.0 MB14 KB
RuntimeTVM + WASMTypeScript

Share zero-tvm

Link copied