Changelog

Witness Janus-AI's Innovation Journey in Multimodal AI

11 months ago

🎉 First Release: Janus - The first multimodal model based on DeepSeek-LLM-1.3b-base and SigLIP-L
✨ Features: Visual encoding for multimodal understanding and generation
📜 MIT License for full commercial use

11 months ago

🚀 Released JanusFlow - A unified multimodal understanding and generation model
📊 Surpassed LLaVA-v1.5 and Qwen-VL-Chat in benchmark tests
🖼️ Support for 384×384 image generation

10 months ago

🌐 Launched online demo for Janus
🔄 Added support for DeepSeek Janus and Qwen2-VL
💻 Browser-based pure inference experience

8 months ago

🎯 Released Janus-Pro 1.3B & 7B models
🔥 Surpassing previous unified models in performance
🛠️ Enhanced framework flexibility and efficiency
🌟 Advanced visual encoding decoupling architecture