FILTER MODE ACTIVE

#transformer

Records found: 6

#transformer26/11/2025

Build a Mini‑GPT with Tinygrad: Hands‑On Transformer Internals from Scratch

'Step-by-step Tinygrad guide that builds tensors, attention, transformer blocks and a mini-GPT from scratch, with full code and experiments on training and kernel fusion.'

READ →

#transformer30/10/2025

IBM Unveils Granite 4.0 Nano: Enterprise-Ready Small Models for Edge AI

'IBM launches Granite 4.0 Nano: eight compact models (350M and ~1B) in hybrid SSM and transformer variants, built for local and edge inference with enterprise-grade governance and open licensing.'

READ →

#transformer17/09/2025

MapAnything: A Single Transformer for Metric 3D Reconstruction from Images

'MapAnything is a unified transformer model that directly outputs factored metric 3D scene geometry from images and optional sensor inputs, achieving state-of-the-art results across multiple reconstruction tasks and datasets.'

READ →

#transformer09/08/2025

Build a Gemini-Powered PaperQA2 Agent for Deep Scientific Literature Analysis

'Step-by-step guide to create a Gemini-powered PaperQA2 agent that can query, compare and cite multiple research papers with example code and best practices.'

READ →

#transformer24/06/2025

BAAI Unveils OmniGen2: A Breakthrough Unified Model for Multimodal AI

BAAI presents OmniGen2, a cutting-edge unified model for multimodal AI that excels in text-to-image generation, image editing, and contextual consistency, setting new open-source performance benchmarks.

READ →

#transformer07/05/2025

Fudan University Unveils Lorsa: Decoding Transformer Attention Superposition with Sparse Mechanisms

Fudan University researchers have developed Lorsa, a sparse attention mechanism that disentangles atomic attention units hidden in transformer superposition, enhancing interpretability of large language models.

READ →