Generate images from a text prompt right in your browser. Everything runs locally on your machine so nothing is sent to a server. Models small enough to run in a browser are still limited, so this is just a demo for now, expect it to improve over time.
Images are generated by Janus-Pro-1B, an open multimodal model from DeepSeek, run in the browser by Transformers.js 3.5.1 on ONNX Runtime Web with WebGPU.
It runs entirely on your machine — no server, no upload. The model (onnx-community/Janus-Pro-1B-ONNX, ~1–2 GB) downloads once from the Hugging Face Hub, then is cached by the browser.