FILTER MODE ACTIVE

#fine-tuning

Records found: 9

#fine-tuning14/08/2025

Gemma 3 270M: Tiny, Tunable, and Ultra-Efficient for Task-Specific Fine-Tuning

Gemma 3 270M is a compact, 270M-parameter model by Google AI designed for energy-efficient, task-specific fine-tuning and on-device deployment with INT4 QAT support

READ →

#fine-tuning05/08/2025

Anthropic AI Develops Persona Vectors to Tackle Personality Shifts in Large Language Models

Anthropic AI proposes a novel method using persona vectors to detect and control personality shifts in large language models, enhancing their reliability and safety.

READ →

#fine-tuning04/07/2025

ASTRO Boosts Llama 3 Reasoning by Over 16% Using Post-Training Techniques

ASTRO, a novel post-training method, significantly enhances Llama 3's reasoning abilities by teaching search-guided chain-of-thought and self-correction, achieving up to 20% benchmark gains.

READ →

#fine-tuning27/06/2025

Unbabel Launches TOWER+: The Breakthrough Multilingual LLM for Accurate Translation and Instruction Following

Unbabel introduces TOWER+, a unified multilingual large language model that excels in both high-fidelity translation and instruction-following, surpassing existing open-weight models in benchmarks.

READ →

#fine-tuning18/06/2025

Revolutionizing Transformer Adaptation: From Fine-Tuning to Advanced Prompt Engineering

New research demonstrates that inference-time prompting can effectively approximate fine-tuned transformer models, offering a resource-efficient approach to NLP tasks without retraining.

READ →

#fine-tuning20/05/2025

Bridging In-Context Learning and Fine-Tuning: New Advances in Language Model Generalization

New research reveals how integrating in-context learning insights into fine-tuning datasets significantly improves language model generalization on complex reasoning tasks.

READ →

#fine-tuning14/05/2025

Tackling Over-Refusal in Language Models: The FalseReject Dataset

The FalseReject dataset helps language models overcome excessive caution by training them to respond appropriately to sensitive yet harmless prompts, enhancing AI usefulness and safety.

READ →

#fine-tuning10/05/2025

Salesforce’s xGen-small Revolutionizes Enterprise AI with Efficient Long-Context Processing

Salesforce’s xGen-small offers a compact AI model delivering efficient long-context understanding with reduced costs and strong privacy, transforming enterprise AI workflows.

READ →

#fine-tuning09/05/2025

OpenAI Unveils Reinforcement Fine-Tuning on o4-mini for Advanced Custom AI Models

OpenAI launches Reinforcement Fine-Tuning on the o4-mini model, enabling developers to customize AI reasoning with precision using reinforcement learning techniques.

READ →