Gate News message, April 11, AI infrastructure company Ramp Labs released research findings called “Latent Briefing,” enabling efficient memory sharing among multi-agent systems by directly compressing large-model KV caches, greatly reducing Token consumption without losing accuracy. In mainstream multi-agent architectures, the orchestrator breaks down tasks and repeatedly calls worker model instances; as the inference chain grows longer, Token usage expands exponentially. The core idea behind Latent Briefing is to use the attention mechanism to identify the truly crucial parts of the context, discard redundant information directly at the representation layer, rather than relying on slow LLM summarization or RAG retrieval with less stable results. On the LongBench v2 benchmark, the method performed impressively: the worker model’s Token consumption dropped by 65%, the Token savings’ median for medium-length documents (32k to 100k) reached 49%, overall accuracy improved by about 3 percentage points versus the baseline, and the additional time spent per compression was only about 1.7 seconds—roughly a 20x speedup compared with the original algorithm. The experiments used Claude Sonnet 4 as the orchestrator and Qwen3-14B as the worker model, covering a wide range of document scenarios including academic papers, legal documents, novels, and government reports. The study also found that the optimal compression threshold varies with task difficulty and document length—hard problems are better suited to aggressive compression to filter speculative reasoning noise, while long documents are better suited to lighter compression to preserve dispersed key information.
Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to
Disclaimer.
Related Articles
Nvidia Deploys OpenAI Codex AI Agent Across Entire Workforce on Blackwell Infrastructure
Gate News message, April 25 — Nvidia has rolled out OpenAI's Codex, an AI agent powered by GPT-5.5, to its entire workforce following a successful trial with approximately 10,000 employees, according to internal communications from CEO Jensen Huang and OpenAI CEO Sam Altman.
Codex is designed to as
GateNews15m ago
AI Coding Startup Cognition in Talks for $25B Valuation Funding Round
Gate News message, April 25 — AI coding startup Cognition is in early talks to raise hundreds of millions of dollars or more at approximately a $25 billion valuation, according to people familiar with the matter. Interest has increased following SpaceX's acquisition of a rival AI coding startup.
Co
GateNews35m ago
AI Trading Agent Platform Fere AI Raises $1.3M, Led by Ethereal Ventures
Gate News message, April 25 — AI-powered digital asset trading agent platform Fere AI announced the completion of a $1.3 million funding round, led by Ethereal Ventures, with Galaxy Vision Hill and Kosmos Ventures participating. The platform supports cross-chain networks including Ethereum,
GateNews1h ago
OpenClaw v2026.4.23 Adds gpt-image-2 Direct OAuth Support, Introduces Forked Context Mode for Sub-agents
Gate News message, April 25 — OpenClaw, an open-source AI agent framework, released v2026.4.23 on April 23, introducing updates across image generation, sub-agent mechanisms, and security hardening.
Image generation enhancements allow gpt-image-2 to be called directly via Codex OAuth without
GateNews2h ago
Fere AI Completes $1.3M Funding Round Led by Ethereal Ventures
Gate News message, April 24 — Fere AI, an AI-powered digital asset trading agent platform, announced the completion of a $1.3 million funding round led by Ethereal Ventures, with Galaxy Vision Hill and Kosmos Ventures participating.
The platform supports cross-chain networks including Ethereum,
GateNews13h ago