The Llama lightweight (1B/3B) models enable developers to bring Llama’s capabilities to mobile and embedded devices.
Meta is collaborating with the following partners to provide guidance and foundational software to use the Llama lightweight models on their device hardware. Browse their offerings below and follow the provided links to obtain more detail.
Arm Kleidi technologies unlock unprecedented out-of-the-box performance for running LLMs everywhere from cloud to edge, enabling acceleration for Llama 3.2 through library integration into AI frameworks.
Developers can port Llama models to GenAI enabled MediaTek products using the MediaTek Neuropilot LLM toolkit. The toolkit supports up to 4-bit quantization, LoRA fine-tuning, advanced graph and cache optimizations, and accelerated decoding techniques that promise best-in-class inference efficiency without noticeable loss of accuracy.