Docker Model Runner Brings Local LLMs to Your Desktop

a year ago

#AI
#LLMs
#Docker

Docker Model Runner is a new beta feature in Docker Desktop 4.40 for Apple silicon-powered Macs, enabling developers to run LLMs and other AI models locally.
Running LLMs locally with Docker Model Runner ensures data privacy, reduces latency, and lowers costs by eliminating the need for external cloud APIs.
Docker Model Runner packages LLMs as Open Container Initiative (OCI) Artifacts, allowing integration into CI/CD pipelines using familiar Docker CLI commands.
The feature uses llama.cpp, an open-source C++ library, to efficiently deploy and infer LLMs without GPUs, making it possible to run models on older hardware.
Docker is partnering with AI leaders like Google, HuggingFace, and Qualcomm to expand the selection of high-quality, optimized models available for local use.
Future plans include expanding support to more platforms, such as Windows with GPU acceleration, and enabling developers to publish custom models.
Docker is integrating the Model Context Protocol (MCP), an open standard for connecting AI agents to data sources, tools, and application environments.
The Docker MCP Catalog provides a centralized way to discover, run, and manage over 100 MCP servers from leading providers.
The Docker MCP Toolkit offers enterprise-grade management and security features for MCP workflows, including registry access management and secrets handling.
Docker aims to make AI integration seamless, allowing developers to build, test, and deploy AI applications with the same ease as traditional containerized workflows.

Hasty Briefsbeta

Docker Model Runner Brings Local LLMs to Your Desktop