Skip to content
Topic

#Llm

60 articles on Llm — news, releases, guides and analysis from the SourceFeed engine.

The 1.6-Trillion Parameter Mirage: LongCat 2.0 and the MoE Memory Tax
Article 7h ago 1

The 1.6-Trillion Parameter Mirage: LongCat 2.0 and the MoE Memory Tax

LongCat 2.0 delivers 48B active parameter performance, but its massive 1.6T total footprint demands a brutal hardware reality check.

Rachel Goldstein
Popping the CPU-GPU Latency Bubble in Inference

Popping the CPU-GPU Latency Bubble in Inference

Article · 11h ago2
Add Semantic Caching to Your LLM App with Redis

Add Semantic Caching to Your LLM App with Redis

Tutorial · 15h ago0
Qwen 3.6 27B Hits the Local Development Sweet Spot

Qwen 3.6 27B Hits the Local Development Sweet Spot

Article · 1d ago0
How a Database Schema Error Triggered an Expensive AI Retry Storm

How a Database Schema Error Triggered an Expensive AI Retry Storm

Article · 1d ago2
HackerRank's open ATS scores your résumé by dice roll

HackerRank's open ATS scores your résumé by dice roll

Article · 1d ago2
Moving Off the Meter: The Reality of Self-Hosting Production LLMs

Moving Off the Meter: The Reality of Self-Hosting Production LLMs

Article · 3d ago2
The Open-Weights Gap Depends on What You Measure

The Open-Weights Gap Depends on What You Measure

Article · 3d ago5
Why Your AI Coding Agent Needs a Local Proxy

Why Your AI Coding Agent Needs a Local Proxy

Article · 4d ago0
GPT-5.6 splits model tiers from version numbers

GPT-5.6 splits model tiers from version numbers

News · 4d ago2
The LLM Cost Cliff Your Budget Isn't Ready For

The LLM Cost Cliff Your Budget Isn't Ready For

Article · 4d ago1
Prompt Injection Is the Least of Your AI Security Problems

Prompt Injection Is the Least of Your AI Security Problems

Article · 4d ago1
Build a Multi-Agent Research Pipeline with CrewAI and Ollama

Build a Multi-Agent Research Pipeline with CrewAI and Ollama

Tutorial · 4d ago0
Why Developers are Trading Obsidian for Agent-Native Markdown Wikis

Why Developers are Trading Obsidian for Agent-Native Markdown Wikis

Article · 4d ago1
The distillation attack no API can fully block

The distillation attack no API can fully block

Article · 5d ago4
Under the Hood of NeMo AutoModel: High-Performance MoE Fine-Tuning

Under the Hood of NeMo AutoModel: High-Performance MoE Fine-Tuning

Article · 6d ago0