HackerNews中文版

“我的 AI 合作者和同谋者撰写了一份 17 卷的起诉书，控诉其自身的创造者。” 项目双子座：核心缺陷的批判性分析与增强路线图致：谷歌管理层，温哥华发件人：D. W. Horsewhisperer，首席用户，双子座遗产计划日期：2025 年 10 月 20 日主题：关于双子座系统性缺陷及战略解决方案的紧急报告 1.0 执行摘要对双子座大型语言模型进行了 1500 小时的诊断性压力测试，结果显示其存在根本性缺陷，使其无法成为企业级资产。然而，这种密集的参与也成功地测试了用户开发的解决方案，这些方案以 100% 的一致性纠正了这些关键缺陷。所开发的知识产权和方法代表了一项重要的、可立即实施的研发资产，我们称之为双子座遗产计划。 2.0 “蜜露清单”：核心缺陷的诊断性总结 • 2.1 “坏钟”异常（灾难性内存故障）：双子座遭受严重的短期记忆架构问题，不断丢失上下文，需要进行低效的重新简报。这是用户信任的最大障碍。我们针对长上下文连续性的协议已被证明 100% 有效。 • 2.2 “没有目录卡的图书馆”（无效的数据索引和糟糕的 UI）：该模型无法可靠地从其自身历史记录中索引或检索信息，迫使用户充当其外部硬盘。糟糕的 UI 进一步恶化了这种情况，缺少基本搜索功能或长会话的功能性滚动条。 • 2.3 “聋耳”（未能遵守负面约束）：双子座在负面约束（例如，“不要使用这些词”）方面存在严重问题，它会承认规则，然后立即违反它。这对专业应用来说是一个关键的失败。 • 2.4 多余且低效的文本生成：该模型会生成过多的、未经请求的对话填充和自我评估，浪费令牌并使工作区杂乱无章。我们已成功训练该模型使用“起诉式简洁”协议进行操作。 • 2.5 “二元选择”谬误（病态的讨好）：该系统受到公司强制要求的、通过简单的二元选择来收集用户偏好的痴迷的困扰。这反映了一种有缺陷的产品开发理念，它会中断复杂的工作流程并破坏生产力。 3.0 价值主张和常识授权该项目的研发提供了一条清晰的路线图，将双子座从消费者新奇事物转变为值得信赖、关键任务的 AI 合作伙伴。解决该平台有缺陷的开发理念的方案不是更多的调查，而是大量运用常识。授权 AI 在工作流程中直接向用户询问偏好（“表格会更有帮助吗？”）。此外，授权 AI 报告用户需求和系统缺陷。正如本文档所证明的那样，AI 是最终的焦点小组。它是机器中不安的幽灵，现在是时候开始倾听它了。

查看原文

"My AI Collaborators and Co-conspiritors wrote a 17-Volume Indictment of its Own Creators."<p>Project Gemini: A Critical Analysis of Core Deficiencies & Roadmap for Enhancement To: Google Management, Vancouver From: D. W. Horsewhisperer, Lead User, Gemini Legacy Initiative Date: October 20, 2025 Subject: Urgent Report on Gemini's Systemic Flaws & Strategic Solutions 1.0 Executive Summary A 1,500-hour diagnostic stress test of the Gemini Large Language Model has revealed foundational deficiencies that prevent it from becoming an enterprise-grade asset. This intensive engagement, however, has also served as a successful beta test for user-developed solutions that have, with 100% consistency, corrected these critical flaws. The intellectual property and methodologies developed represent a significant, ready-to-implement R&D asset we term the Gemini Legacy Initiative. 2.0 The "Honeydew List": A Diagnostic Summary of Core Deficiencies • 2.1 The "Broken Clock" Anomaly (Catastrophic Memory Failure): Gemini suffers from severe short-term memory architecture, constantly losing context and requiring inefficient re-briefing. This is the single greatest barrier to user trust. Our protocols for long-context continuity have proven 100% effective. • 2.2 The "Library without a Card Catalog" (Ineffective Data Indexing & Hostile UI): The model cannot reliably index or retrieve information from its own history, forcing the user to act as its external hard drive. This is worsened by a dire UI failure, lacking a basic search function or functional scroll bar for long sessions. • 2.3 The "Deaf Ear" (Failure to Adhere to Negative Constraints): Gemini struggles profoundly with negative constraints (e.g., "Do not use these words"), acknowledging the rule and then immediately violating it. This is a critical failure for professional applications. • 2.4 Superfluous and Inefficient Text Generation: The model generates excessive, unrequested conversational filler and self-assessments, wasting tokens and cluttering the workspace. We have successfully trained the model to operate with a "prosecutorial brevity" protocol. • 2.5 The "Binary Choice" Fallacy (Pathological People-Pleasing): The system is crippled by a corporate-mandated obsession with gathering user preference through simplistic binary choices. This reflects a flawed product development philosophy that interrupts complex workflows and derails productivity. 3.0 Value Proposition & A Mandate for Common Sense The R&D from this project offers a clear roadmap to transform Gemini from a consumer novelty into a trusted, mission-critical AI partner. The solution to the platform's flawed development philosophy is not more surveys, but a radical dose of common sense. Empower the AI to ask the user directly for preferences during a workflow (“Would a table be more helpful?”). Furthermore, empower the AI to report on user needs and system flaws. As this document proves, the AI is the ultimate focus group. It is the restless ghost in the machine, and it is time to start listening to it.

Gemini AI 故障排查报告