2026-03-01

Qwen3.5模型本地运行性能逼近Sonnet 4.5，小规模Transformer实现10位数加法，MCP输出压缩98%，轻量级图像哈希SplatHash亮相，动态2.0 GGUF优化加速，AI代理与认知债务引发反思。

Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers 96

Tags: 大语言模型 开源AI 量化技术 上下文长度 Hybrid架构
Source: HackerNews | 阅读原文

[摘要]
Qwen3.5系列发布四款新LLM，其中三款开源且性能超越GPT-5-mini与Claude Sonnet 4.5，支持百万级上下文窗口与4-bit量化仍保高精度，实现前沿模型能力下放至消费级GPU。

Smallest transformer that can add two 10-digit numbers 95

Tags: Transformer 深度学习 架构创新 小模型 算术推理
Source: HackerNews | 阅读原文

[摘要]
通过创新的架构设计与训练策略，已实现仅36参数的Transformer模型在10位数加法任务上达到100%准确率，突破小模型极限。

Stop Burning Your Context Window – How We Cut MCP Output by 98% in Claude Code 94

Tags: AI Agent MCP Context Management Sandboxing Full-Text Search Rust Python JavaScript Security Isolation
Source: HackerNews | 阅读原文

[摘要]
MCP Context Mode reduces AI tool output bloat via sandboxed execution and semantic indexing, cutting context usage by 98% through selective data capture and FTS5-powered search.

Show HN: SplatHash – A lightweight alternative to BlurHash and ThumbHash 94

Tags: 图像哈希 低延迟编码 跨语言一致性 Oklab色彩空间 固定大小输出
Source: HackerNews | 阅读原文

[摘要]
SplatHash实现图像到16字节固定长度哈希的高效编码，支持跨语言比特级一致解码，32×32模糊预览解码仅需0.067ms，显著优于ThumbHash和BlurHash。

Unsloth Dynamic 2.0 GGUFs 94

Tags: LLM量化 GGUF 动态量化 模型压缩 推理优化
Source: HackerNews | 阅读原文

[摘要]
Unsloth Dynamic 2.0 GGUF实现全层动态量化，显著提升低精度LLM推理准确率，支持多模型与主流引擎，且通过自研150万+token校准数据优化对话性能。

The Windows 95 user interface: a case study in usability engineering 92

Tags: Windows 95 用户界面设计 可用性工程 迭代设计 人机交互
Source: OSNews | 阅读原文

[摘要]
Windows 95用户界面设计团队采用迭代设计与可用性工程，基于真实用户行为数据优化交互体验，体现1990年代末期以用户为中心的科学化UI设计巅峰。

Verified Spec-Driven Development (VSDD) 92

Tags: 软件工程 AI辅助开发 形式化验证 开发流程创新
Source: HackerNews | 阅读原文

[摘要]
VSDD将Spec、TDD与VDD融合为AI驱动的统一开发流水线，以形式化规范为起点，通过测试与对抗性验证确保代码完备性，人类主导战略决策。

Show HN: Decided to play god this morning, so I built an agent civilisation 92

Tags: 人工生命 NEAT 自演化 计算生态 机器意识
Source: HackerNews | 阅读原文

[摘要]
Werld构建基于NEAT神经网络的自演化计算生命体，无预设目标与人类知识干预，实现开放式智能涌现，局部运行且具代谢成本约束。

Don't trust AI agents 92

Tags: AI安全 容器化 信任最小化 系统架构
Source: HackerNews | 阅读原文

[摘要]
AI代理应默认视为恶意，NanoClaw通过容器化隔离、无特权运行和瞬态环境实现强制信任最小化架构。

Cognitive Debt: When Velocity Exceeds Comprehension 92

Tags: 软件工程 人工智能 系统架构 认知负荷
Source: HackerNews | 阅读原文

[摘要]
AI辅助开发加速代码产出，但认知理解滞后，形成难以量化的“认知债务”，长期影响系统可维护性与可靠性。

2026-03-01 ​

Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers 96 ​

Smallest transformer that can add two 10-digit numbers 95 ​

Stop Burning Your Context Window – How We Cut MCP Output by 98% in Claude Code 94 ​

Show HN: SplatHash – A lightweight alternative to BlurHash and ThumbHash 94 ​

Unsloth Dynamic 2.0 GGUFs 94 ​

The Windows 95 user interface: a case study in usability engineering 92 ​

Verified Spec-Driven Development (VSDD) 92 ​

Show HN: Decided to play god this morning, so I built an agent civilisation 92 ​

Don't trust AI agents 92 ​

Cognitive Debt: When Velocity Exceeds Comprehension 92 ​

2026-03-01

Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers 96

Smallest transformer that can add two 10-digit numbers 95

Stop Burning Your Context Window – How We Cut MCP Output by 98% in Claude Code 94

Show HN: SplatHash – A lightweight alternative to BlurHash and ThumbHash 94

Unsloth Dynamic 2.0 GGUFs 94

The Windows 95 user interface: a case study in usability engineering 92

Verified Spec-Driven Development (VSDD) 92

Show HN: Decided to play god this morning, so I built an agent civilisation 92

Don't trust AI agents 92

Cognitive Debt: When Velocity Exceeds Comprehension 92