A Caltech Lab at PrismML Just Fit an 8 Billion Parameter AI Model Into 1.15 GB. Announcing a Breakthrough in AI Compression: ...
Multiverse Computing S.L. said today it has raised $215 million in funding to accelerate the deployment of its quantum computing-inspired artificial intelligence model compression technology, which ...
Google has published TurboQuant, a KV cache compression algorithm that cuts LLM memory usage by 6x with zero accuracy loss, ...
Effective compression is about finding patterns to make data smaller without losing information. When an algorithm or model can accurately guess the next piece of data in a sequence, it shows it’s ...
Nvidia researchers have introduced a new technique that dramatically reduces how much memory large language models need to track conversation history — by as much as 20x — without modifying the model ...
Intel has disclosed a maximum severity vulnerability in some versions of its Intel Neural Compressor software for AI model compression. The bug, designated as CVE-2024-22476, provides an ...
Chinese artificial intelligence startup DeepSeek has introduced DeepSeek-OCR, an open-source model accompanied by a research paper that pioneers a novel "optical compression" method aimed at reducing ...