Researchers automated LLM reasoning strategy design and cut token usage by 69.5%

🤖 Yapay Zeka 📰 VentureBeat 🕐 5 gün önce

Test-time scaling (TTS) has emerged as a proven method to improve the performance of large language models in real-world applications by giving them extra compute cycles at inference time. However, TTS strategies have historically been handcrafted, relying heavily on human intuition to dictate the rules of the model’s reasoning. To address this bottleneck, researchers from Meta, Google, and several universities have introduced AutoTTS , a framework that automatically discover

Researchers have developed a new framework called AutoTTS that automates the design of test-time scaling (TTS) strategies for large language models. Previously, these strategies, which optimize model performance by allocating extra computational resources during inference, were handcrafted by humans. AutoTTS reframes this process as an algorithmic search, enabling dynamic optimization of compute allocation without manual intervention. This automated approach has demonstrated significant efficiency gains, reducing token usage by up to 69.5% in trials without compromising accuracy.

This development matters because it promises to significantly lower the operational costs and improve the efficiency of deploying advanced reasoning models in production environments.

#large language model#llm#research#app

📌 Kaynak

Bu özet VentureBeat kaynağından otomatik derlenmiştir. Tamamı için orijinal habere gidin.

Orijinal haberi oku →

← Tüm haberlere dön