IBM Granite 4.0

a day ago

Copy Link

Granite 4.0 introduces IBM's new family of efficient, enterprise-ready large language models (LLMs).
The models focus on agentic workflows, offering competitive performance at reduced costs and latency.
Granite 4.0 includes multiple model sizes (Micro, Tiny, Small) optimized for various hardware constraints.
Performance benchmarks show Granite 4.0 models outperform previous generations, even with smaller sizes.
The hybrid architecture combines Mamba-2 layers (90%) with transformer blocks (10%) for efficiency.
Granite 4.0 reduces RAM usage by over 70% for long-context and multi-session tasks compared to conventional LLMs.
IBM achieved ISO 42001 certification for Granite, ensuring safety, security, and transparency in AI management.
Granite 4.0 models are compatible with AMD Instinct MI-300X GPUs and Qualcomm Hexagon NPUs for on-device deployment.
The models excel in instruction-following, function calling, and retrieval-augmented generation (RAG) tasks.
IBM provides cryptographic signing for model checkpoints and a bug bounty program (up to $100,000) for security.
Future releases include reasoning-focused 'Thinking' variants and additional model sizes (Nano, Medium).
Granite 4.0 is available on platforms like Hugging Face, IBM watsonx.ai, NVIDIA NIM, and Docker Hub.

Hasty Briefsbeta