FILTER MODE ACTIVE

#Qwen

Records found: 5

#Qwen04/11/2025

Cache-to-Cache (C2C): LLMs Communicate Directly Through KV-Cache Fusion

'Cache-to-Cache (C2C) lets LLMs exchange semantic information via KV-Cache fusion, boosting accuracy by about 3–10% over text-based pipelines and roughly halving latency.'

READ →

#Qwen31/10/2025

Autonomous Multi-Agent Data Orchestrator with Lightweight Qwen Models

'A hands-on tutorial showing how lightweight Qwen2.5-0.5B-Instruct agents manage ingestion, quality, and infrastructure optimization in multi-agent data pipelines.'

READ →

#Qwen03/10/2025

NeuTTS Air: 748M On-Device TTS With Instant 3‑Second Voice Cloning

'Neuphonic released NeuTTS Air, a 748M-parameter on-device TTS model that clones voices from ~3 seconds of reference audio and runs locally via GGUF quantizations for CPU-first real-time synthesis.'

READ →

#Qwen03/10/2025

Tinker from Thinking Machines: Low-Level API for Distributed LoRA Training

'Tinker is a Python API that exposes low-level training primitives so you can run custom loops locally while the platform handles distributed execution; it focuses on LoRA adapters, portable weights, and managed GPU clusters.'

READ →

#Qwen30/08/2025

Build a Brain‑Inspired Hierarchical Reasoning Agent Locally with Hugging Face

A practical tutorial showing how to run a brain-inspired hierarchical reasoning agent locally with a free Hugging Face model, using planning, code-based solvers, critique, and synthesis.

READ →