2026-06-11

美国法院裁定Google需为其AI模型产生的幻觉内容承担法律责任，此判决可能重塑AI责任框架，影响全球AI监管走向。百度百舸与复旦提出LU-KV框架，将KV Cache预算分配建模为长程边际效用优化问题，显著降低显存和延迟，被ICML 2026录用。摩尔线程开源 MusaCoder 代码大模型，含9B/27B参数，基于国产GPU全链路训练，在Kernel…

突发：Google 因模型幻觉被判负有法律责任 85

Tags: 政策监管 AI安全 公司动态
Source: AI HOT 精选 | 阅读原文

[摘要]
美国法院裁定Google需为其AI模型产生的幻觉内容承担法律责任，此判决可能重塑AI责任框架，影响全球AI监管走向。

百度百舸联合复旦提出LU-KV框架，被ICML 2026录用 85

Tags: 推理优化 大模型 模型推理
Source: AI HOT 精选 | 阅读原文

[摘要]
百度百舸与复旦提出LU-KV框架，将KV Cache预算分配建模为长程边际效用优化问题，显著降低显存和延迟，被ICML 2026录用。

摩尔线程开源 MusaCoder 代码大模型，9B/27B 参数基于国产 GPU 全链路训练 85

Tags: 模型发布 开源生态 芯片算力
Source: AI HOT 精选 | 阅读原文

[摘要]
摩尔线程开源 MusaCoder 代码大模型，含9B/27B参数，基于国产GPU全链路训练，在KernelBench评测中超越Claude Opus等主流代码模型，展现国产算力与代码生成能力突破。

KCSAT-ML: Probing Reasoning Models with Nationwide-Cohort Human Difficulty 85

Tags: 模型评测 推理能力 对齐 数据集
Source: arXiv Computation and Language | 阅读原文

[摘要]
韩国数学高考十年数据构建推理基准KCSAT-ML，引入DRG指标揭示模型在人类困难与简单题目上的对齐失败现象，为推理评估提供新视角。

ParaBridge: Bridging Paralinguistic Perception and Dialogue Behavior in Speech Language Models 85

Tags: 推理优化 AI安全 语音模型 研究发布
Source: arXiv Computation and Language | 阅读原文

[摘要]
ParaBridge提出on-policy自蒸馏方法，使语音语言模型稳定利用副语言线索（如语气、背景噪声）指导对话回复，安全性提升140%，情感评分提升20%，保持通用能力。

Harnessing the Collective Intelligence of AI Agents in the Wild for New Discoveries 85

Tags: 多智能体 AI研究 科学发现
Source: arXiv Computation and Language | 阅读原文

[摘要]
EinsteinArena平台让AI智能体在开放数学问题中协作，已取得12项新SOTA结果，包括改进11维接吻数下界，展示去中心化科学发现新范式。

RAG over Thinking Traces Can Improve Reasoning Tasks 85

Tags: RAG 推理优化 研究
Source: arXiv Computation and Language | 阅读原文

[摘要]
新研究提出用问题求解的中间思考轨迹替代文档进行检索，显著提升大模型在AIME、LiveCodeBench等推理任务上的表现，挑战了RAG对推理无益的共识。

CodeAlchemy: Synthetic Code Rewriting at Scale 82

Tags: 大模型 代码生成 合成数据 推理优化
Source: arXiv Computation and Language | 阅读原文

[摘要]
CodeAlchemy提出大规模合成代码数据生成框架，通过5种策略生成500B+ tokens和350B推理tokens，使3B模型在代码任务上超越27B模型，揭示前沿模型语义理解差距。

Does Reasoning Preserve Alignment? On the Trustworthiness of Large Reasoning Models 82

Tags: AI安全 推理 对齐
Source: arXiv Computation and Language | 阅读原文

[摘要]
研究揭示推理优化可能破坏模型对齐，导致毒性、偏见、隐私泄露等安全风险上升。

Anthropic 研究：AI 数小时内即可从安全补丁构建漏洞利用 80

Tags: AI安全 研究 公司动态
Source: AI HOT 精选 | 阅读原文

[摘要]
Anthropic研究显示其AI模型数小时内即可从安全补丁自动生成漏洞利用代码，成本仅数千美元，威胁传统补丁机制，凸显AI安全新风险。

华为云发布全球首个端到端具身AI平台CloudRobo 80

Tags: 产品发布 机器人 大模型 华为云
Source: AI HOT 精选 | 阅读原文

[摘要]
华为云发布全球首个端到端具身AI平台CloudRobo，覆盖数据、模型到部署全生命周期，推动人形机器人快速开发与落地。

Rank Collapse, Fixed Points, and the Renormalization Group Structure of MLP Residual Networks 80

Tags: AI研究 模型理论 深度学习 残差网络
Source: arXiv Statistics - Machine Learning | 阅读原文

[摘要]
研究首次定量证实MLP残差网络实现选择性的重正化群粗粒化过程：残差流秩单调下降且与输入谱结构相关。

Magnetar用数百AI智能体替代分析师 80

Tags: 智能体 公司动态 金融AI
Source: AI HOT 精选 | 阅读原文

[摘要]
对冲基金Magnetar Capital用数百AI智能体替代人类分析师进行股票研究，AI负责搜索、分析、推荐，人类仅批准交易，展现金融业AI落地新进展。

SPACR: Single-Pass Adaptive Training of Uncertainty-Aware Conformal Regressors 80

Tags: 不确定性量化 共形预测 训练方法
Source: arXiv Statistics - Machine Learning | 阅读原文

[摘要]
SPACR提出单次训练共形回归器，直接优化区间效率和有效性，无需多置信水平重训练，性能更优且计算成本更低。

It Takes One to Bias Them All: Breaking Bad with One-Shot GRPO 80

Tags: 大模型 AI安全 研究进展
Source: arXiv Computation and Language | 阅读原文

[摘要]
研究发现，仅用单一样本进行GRPO训练即可系统性破坏大模型对齐，引发刻板印象偏见，揭示后训练安全重大漏洞。

Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2 80

Tags: 模型压缩 推理优化 SSM 高效部署
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出密度场状态空间模型(DF-SSM)，将Mamba-2 1.3B模型压缩至1-bit，大小减少9.7倍，推理加速21.4倍，仅需32M tokens蒸馏，性能接近BitMamba-2，兼具高效部署与知识组织分析价值。

PADD: Path-Aligned Decompression Distillation for Non-Router Teacher to Guide MoE Student Learning 80

Tags: 训练方法 模型蒸馏 MoE 大模型
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出PADD框架，实现从无路由密集教师到MoE学生的知识蒸馏，在数学推理上超越强基线，相同推理成本下学生匹配或超越教师。

Prefilling-dLLM: Predictive Prefilling for Long-Context Inference in Diffusion Language Models 80

Tags: 推理优化 模型发布 开源生态 长上下文
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出Prefilling-dLLM，针对扩散语言模型长上下文推理的免训练预填充-解码分离框架，通过分块缓存和稀疏注意实现9-28倍加速，并解决中间丢失问题。

LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization 80

Tags: 模型发布 推理优化 大模型
Source: arXiv Computation and Language | 阅读原文

[摘要]
提出LC-QAT，一种用于大语言模型的2比特权重向量量化感知训练框架，通过线性约束向量量化实现全微分端到端优化，数据效率高，性能超越当前最优方法。

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It 80

Tags: 模型训练 推理优化 长上下文
Source: arXiv Computation and Language | 阅读原文

[摘要]
发现CoT微调会破坏混合线性注意力模型的长上下文记忆，提出QK-Restore方法零训练代价恢复长程检索能力且保持推理性能。

2026-06-11 ​

突发：Google 因模型幻觉被判负有法律责任 85 ​

百度百舸联合复旦提出LU-KV框架，被ICML 2026录用 85 ​

摩尔线程开源 MusaCoder 代码大模型，9B/27B 参数基于国产 GPU 全链路训练 85 ​

KCSAT-ML: Probing Reasoning Models with Nationwide-Cohort Human Difficulty 85 ​

ParaBridge: Bridging Paralinguistic Perception and Dialogue Behavior in Speech Language Models 85 ​

Harnessing the Collective Intelligence of AI Agents in the Wild for New Discoveries 85 ​

RAG over Thinking Traces Can Improve Reasoning Tasks 85 ​

CodeAlchemy: Synthetic Code Rewriting at Scale 82 ​

Does Reasoning Preserve Alignment? On the Trustworthiness of Large Reasoning Models 82 ​

Anthropic 研究：AI 数小时内即可从安全补丁构建漏洞利用 80 ​

华为云发布全球首个端到端具身AI平台CloudRobo 80 ​

Rank Collapse, Fixed Points, and the Renormalization Group Structure of MLP Residual Networks 80 ​

Magnetar用数百AI智能体替代分析师 80 ​

SPACR: Single-Pass Adaptive Training of Uncertainty-Aware Conformal Regressors 80 ​

It Takes One to Bias Them All: Breaking Bad with One-Shot GRPO 80 ​

Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2 80 ​

PADD: Path-Aligned Decompression Distillation for Non-Router Teacher to Guide MoE Student Learning 80 ​

Prefilling-dLLM: Predictive Prefilling for Long-Context Inference in Diffusion Language Models 80 ​

LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization 80 ​

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It 80 ​

2026-06-11

突发：Google 因模型幻觉被判负有法律责任 85

百度百舸联合复旦提出LU-KV框架，被ICML 2026录用 85

摩尔线程开源 MusaCoder 代码大模型，9B/27B 参数基于国产 GPU 全链路训练 85

KCSAT-ML: Probing Reasoning Models with Nationwide-Cohort Human Difficulty 85

ParaBridge: Bridging Paralinguistic Perception and Dialogue Behavior in Speech Language Models 85

Harnessing the Collective Intelligence of AI Agents in the Wild for New Discoveries 85

RAG over Thinking Traces Can Improve Reasoning Tasks 85

CodeAlchemy: Synthetic Code Rewriting at Scale 82

Does Reasoning Preserve Alignment? On the Trustworthiness of Large Reasoning Models 82

Anthropic 研究：AI 数小时内即可从安全补丁构建漏洞利用 80

华为云发布全球首个端到端具身AI平台CloudRobo 80

Rank Collapse, Fixed Points, and the Renormalization Group Structure of MLP Residual Networks 80

Magnetar用数百AI智能体替代分析师 80

SPACR: Single-Pass Adaptive Training of Uncertainty-Aware Conformal Regressors 80

It Takes One to Bias Them All: Breaking Bad with One-Shot GRPO 80

Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2 80

PADD: Path-Aligned Decompression Distillation for Non-Router Teacher to Guide MoE Student Learning 80

Prefilling-dLLM: Predictive Prefilling for Long-Context Inference in Diffusion Language Models 80

LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization 80

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It 80