LLAMA 3.1 8B, 70B and 405B
*All results are self-reported and not identifiably repeatable. Generally expected individual results will differ.
Scribd, Inc. is a multinational tech company focused on the written and spoken word. The company’s three products — Everand, Scribd and SlideShare — deliver knowledge, information and inspiration to over 200 million monthly unique visitors across the globe. The Everand library is home to millions of bestselling and emerging audiobooks, ebooks, podcasts and magazines.
Scribd, Inc. had developed an AI-powered content discovery assistant, Ask AI, to help its Everand digital library users overcome limited search functionality. But they wanted to up-level the solution to deliver personalized recommendations by combining knowledge of more than 195 million content pieces on Everand with nuanced understanding of each customer.
Llama enabled Scribd, Inc. to enhance Ask AI’s responses, optimize performance and manage infrastructure costs. As an open-source product, Llama was cost-effective and highly adaptable. Compared with other closed and open-source models, Llama delivered the essential component for success: high accuracy. Llama’s semantic understanding and unmatched accuracy provided the nuanced responses to transform Ask AI.
Thanks to Llama offering the most comprehensive deployment options of any model provider, the team quickly found an approach compatible with its existing systems: Amazon Web Services (AWS) and Databricks batch inference. Scribd, Inc. integrated Llama into its Ask AI assistant workflow without any major infrastructure changes.
The new Ask AI experience provides on-point responses to nuanced questions.
The Scribd, Inc. team used three Llama models to create the new Ask AI: Llama 3.1 405B, 8B and 70B. To push beyond closed model limitations and achieve deeper customization, the team used Llama 3.1 405B to create synthetic training data and fine-tuned Llama 3.1 8B. With the latter, Scribd, Inc. was able to deliver better results with minimal latency for real-time components of Ask AI while managing the model’s footprint and computing demands.
In the background, Llama 3.1 70B generates content metadata for Ask AI’s knowledge base to improve content discovery and answer accuracy.
Ask AI uses multiple Llama models to achieve performance.
The new Llama-powered Ask AI streamlines the content discovery process by generating intuitive recommendations fast — and delivering a better, more personalized customer experience. At the same time, Llama has optimized the application for Scribd, Inc.
• +76% faster throughput in tokens per second vs. the previous model
• 97.7% accurate Macro-F1 score on intent detection
• 33% compute cost savings with JSON output
Steve Neola Tarazi, Senior Director of Product, Generative AI, Scribd, Inc.
Stay up-to-date
Subscribe to our newsletter to keep up with the latest Llama updates, releases and more.