MeMo's memory model lets teams upgrade their LLM without retraining it — and performance jumps 26%

🤖 Yapay Zeka 📰 VentureBeat 🕐 5 gün önce

Enabling LLMs to acquire new knowledge after training remains a major hurdle for enterprise AI — current solutions are either too expensive, too slow, or constrained by context window limits. MeMo , a framework from researchers at multiple universities, encodes new knowledge into a dedicated smaller memory model that operates separately from the main LLM. The modular architecture works with both open- and closed-source models and sidesteps the complexity of RAG pipelines and

Researchers have developed a new framework called MeMo that allows large language models (LLMs) to incorporate new information without the need for costly retraining or facing limitations of existing methods. MeMo works by encoding new knowledge into a separate, smaller memory model that functions alongside the primary LLM. This modular approach is compatible with various LLM types and bypasses the complexities and inefficiencies associated with techniques like retrieval-augmented generation (RAG) and full model fine-tuning. Experiments indicate that MeMo can effectively handle intricate queries, even when the data retrieval process is imperfect, and it avoids the issue of 'catastrophic forgetting' that plagues direct model updates.

This innovation offers a more efficient and cost-effective way to keep AI models updated with current information, addressing a significant challenge in enterprise AI deployment.

#llm#research

📌 Kaynak

Bu özet VentureBeat kaynağından otomatik derlenmiştir. Tamamı için orijinal habere gidin.

Orijinal haberi oku →

← Tüm haberlere dön