- Home
- Changelog
Changelog
Witness Janus-AI's Innovation Journey in Multimodal AI
11 months ago
- 🎉 First Release: Janus - The first multimodal model based on DeepSeek-LLM-1.3b-base and SigLIP-L
- ✨ Features: Visual encoding for multimodal understanding and generation
- 📜 MIT License for full commercial use
11 months ago
- 🚀 Released JanusFlow - A unified multimodal understanding and generation model
- 📊 Surpassed LLaVA-v1.5 and Qwen-VL-Chat in benchmark tests
- 🖼️ Support for 384×384 image generation
10 months ago
- 🌐 Launched online demo for Janus
- 🔄 Added support for DeepSeek Janus and Qwen2-VL
- 💻 Browser-based pure inference experience
8 months ago
- 🎯 Released Janus-Pro 1.3B & 7B models
- 🔥 Surpassing previous unified models in performance
- 🛠️ Enhanced framework flexibility and efficiency
- 🌟 Advanced visual encoding decoupling architecture