FILTER MODE ACTIVE

#LLMs

Records found: 13

#LLMs15/01/2026

Introducing Engram: Innovative Memory for Sparse LLMs

DeepSeek introduces Engram, enhancing LLM efficiency with a conditional memory axis.

READ →

#LLMs13/01/2026

Understanding AI Observability Layers for LLMs

Explore AI observability layers to enhance LLM performance and reliability.

READ →

#LLMs18/10/2025

AutoCode: Teaching LLMs to Author and Judge Competition-Grade Programming Problems

'AutoCode teaches LLMs to author and verify contest-grade programming problems using a Validator–Generator–Checker(+Interactor) loop and dual verification, achieving near-judge consistency on held-out tasks.'

READ →

#LLMs03/09/2025

Connected Customers: How Agentic AI Is Rewriting Customer Experience

'Agentic AI and unified platforms are enabling faster, more personalized customer service at scale while requiring new infrastructure and careful human-AI balance.'

READ →

#LLMs19/08/2025

Vibe Coding for Data Engineers: When LLMs Accelerate Work and When to Push Back

'Vibe coding lets LLMs generate pipeline code fast, but engineers must enforce idempotence, DAG discipline, and DQ checks before production.'

READ →

#LLMs29/07/2025

Mastering the Self-Refine Technique with Large Language Models Using Mirascope

Discover how to use Mirascope to implement the Self-Refine technique with Large Language Models, enabling iterative improvement of AI-generated responses for enhanced accuracy.

READ →

#LLMs07/07/2025

SynPref-40M and Skywork-Reward-V2: Revolutionizing Human-AI Alignment with Scalable Reward Models

SynPref-40M introduces a huge new preference dataset, enabling the Skywork-Reward-V2 family of models to achieve state-of-the-art results in human-AI alignment across multiple benchmarks.

READ →

#LLMs01/07/2025

OMEGA Benchmark: Testing the Creative Limits of AI in Math Reasoning

OMEGA is a novel benchmark designed to probe the reasoning limits of large language models in mathematics, focusing on exploratory, compositional, and transformational generalization.

READ →

#LLMs17/06/2025

EPFL Unveils MEMOIR: A Breakthrough Framework for Continuous Model Editing in Large Language Models

EPFL researchers have developed MEMOIR, a novel framework that enables continuous, reliable, and localized updates in large language models, outperforming existing methods in various benchmarks.

READ →

#LLMs14/06/2025

Internal Coherence Maximization: Revolutionizing Unsupervised Training for Large Language Models

Internal Coherence Maximization (ICM) introduces a novel label-free, unsupervised training framework for large language models, achieving performance on par with human-supervised methods and enabling advanced capabilities without human feedback.

READ →

#LLMs12/06/2025

Why Large Language Models Miss Instructions and How to Fix It

Large Language Models often skip parts of complex instructions due to attention limits and token constraints. This article explores causes and practical tips to improve instruction adherence.

READ →

#LLMs17/05/2025

Microsoft and Salesforce Reveal Major Performance Drop of LLMs in Real Multi-Turn Conversations

New research from Microsoft and Salesforce shows that large language models experience a 39% performance drop when handling real multi-turn conversations with incomplete instructions, highlighting a key challenge in conversational AI.

READ →

#LLMs13/05/2025

RLV: Enhancing Language Model Reasoning with Integrated Value-Free Verification

RLV introduces a unified framework that integrates verification into value-free reinforcement learning for language models, significantly improving reasoning accuracy and computational efficiency on mathematical reasoning benchmarks.

READ →