Show HN: 将原始屏幕录像转化为带注释截图的精准指南

1作者: docuagent6 个月前
在你说出又一个 RAG 克隆之前,请先听我说几句。<p>问题: 作为创作者:你必须录屏、编辑、注释,然后展示。如果任何内容发生变化,你就要重新进行整个流程。 作为最终用户:你必须观看 5 分钟的视频,而你可能只需要知道视频中的 5 秒钟内容就能完成一个特定任务。<p>解决方案: 对于创作者:录制并上传你的原始屏幕录像。无需其他操作。 对于最终用户:你提出一个问题,你就能得到针对你特定问题的、带有注释截图的精确文档。<p>这与 Scribe 或 RAG 有什么不同? * vs. Scribe:Scribe 用于主动捕获(在你工作时点击)。DocuFine 用于被动提取——它将你现有的原始视频或演示转化为事后的指南。 * vs. RAG:大多数视频 RAG 只是搜索转录文本。DocuFine 使用 LLM“查看”用户界面,然后使用 OCR 将注释“捕捉”到实际的按钮上,因此即使视频静音,指南在空间上也是准确的。<p>该网站尚未上线——我目前正在收集关于该概念和演示的反馈,然后再开放,因为我仍在优化 LLM 成本和提取逻辑。<p>演示链接: - 初始录制:<a href="https://streamable.com/c5gom5" rel="nofollow">https://streamable.com/c5gom5</a> - 提问:如何找到客户下的订单? - 生成的输出指南:<a href="https://streamable.com/9c4ncj" rel="nofollow">https://streamable.com/9c4ncj</a><p>端到端演示:<a href="https://streamable.com/hqb6te" rel="nofollow">https://streamable.com/hqb6te</a><p>欢迎诚实反馈!
查看原文
Before you say another RAG clone, please hear me out for a second.<p>The Problem: As a creator: You have to screen record, edit, annotate, and then present. If anything changes, you redo the process. As an end user: You have to watch a 5-minute video when all you need to know is 5 seconds of that video to perform a specific task.<p>The Solution: For creators: Record and upload your raw screen captures. No further effort. For end users: You ask a question, and you get exactly the document for your specific question with annotated screenshots.<p>How is this different from Scribe or RAG? * vs. Scribe: Scribe is for active capture (clicking while you work). DocuFine is for passive extraction—it turns your existing raw videos or demos into guides after the fact. * vs. RAG: Most video RAG just searches transcripts. DocuFine &quot;sees&quot; the UI using an LLM and then uses OCR to &quot;snap&quot; the annotations to the actual buttons, so the guides are spatially accurate even if the video is silent.<p>The site isn&#x27;t live yet—I&#x27;m currently gathering feedback on the concept and demo before opening it up, as I&#x27;m still optimizing the LLM costs and extraction logic.<p>Demo Links: - Initial Recording: <a href="https:&#x2F;&#x2F;streamable.com&#x2F;c5gom5" rel="nofollow">https:&#x2F;&#x2F;streamable.com&#x2F;c5gom5</a> - Query Asked: How do I find orders placed by a customer? - Generated Output Guide: <a href="https:&#x2F;&#x2F;streamable.com&#x2F;9c4ncj" rel="nofollow">https:&#x2F;&#x2F;streamable.com&#x2F;9c4ncj</a><p>End-to-End-Demo: <a href="https:&#x2F;&#x2F;streamable.com&#x2F;hqb6te" rel="nofollow">https:&#x2F;&#x2F;streamable.com&#x2F;hqb6te</a><p>Honest feedback appreciated!