Block Encoding Compression

10d

Google’s TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x

Google Research recently revealed TurboQuant, a compression algorithm that reduces the memory footprint of large language ...

18d

Nvidia says it can shrink LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory ...

Dolby sues Snap over video compression patent claims tied to AV1 and HEVC

In a complaint filed in the US District Court for the District of Delaware, Dolby accuses Snap of infringing four video compression patents through Snapchat's use ...

Synthetic Identity Fraud Projected to Cost $58.3 Billion as Deepfake Risks Rise

Financial institutions and global payment platforms struggle to verify customer identities as deepfake-driven fraud ...

Streaming Media

The State of Streaming Codecs 2026

Streaming codec adoption used to be an engineering abstraction governed by RD curves, BD-rate tables, and roadmap slides that ...

The mad dash to build the future of multimedia

The Verge is about technology and how it makes us feel. Founded in 2011, we offer our audience everything from breaking news ...

i-SCOOP

Claude Subconscious and the rise of memory first coding agents

Claude Subconscious adds a persistent background memory layer to Claude Code through Letta. Explore how it works, why agent ...

Amazon Spring Sale live blog 2026: Final hours to score top Amazon deals

It's the last few hours of Amazon's Spring Sale, and we're still live-tracking the best deals over 60% off on home, tech, and ...

i-SCOOP

Tokenmaxxing and AI efficiency, how to optimize for outcomes instead of raw token volume

Tokenmaxxing is pushing AI usage to the limit, but more tokens do not automatically mean better results. Learn how to ...

Amazon Spring Sale live blog 2026: Tracking the biggest price drops all weekend

We're live-tracking the best Amazon Spring Sale deals over 60% off on home, tech, and more, as the sale continues this ...

16d

Breaking the 100M Token Limit: EverMind's MSA Architecture Achieves Efficient End-to-End Long-Term Memory for LLMs

The research introduces a novel memory architecture called MSA (Memory Sparse Attention). Through a combination of the Memory Sparse Attention mechanism, Document-wise RoPE for extreme context ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results