Show HN: 将原始屏幕录像转化为带注释截图的精准指南
1 分•作者: docuagent•6 个月前
在你说出又一个 RAG 克隆之前,请先听我说几句。<p>问题:
作为创作者:你必须录屏、编辑、注释,然后展示。如果任何内容发生变化,你就要重新进行整个流程。
作为最终用户:你必须观看 5 分钟的视频,而你可能只需要知道视频中的 5 秒钟内容就能完成一个特定任务。<p>解决方案:
对于创作者:录制并上传你的原始屏幕录像。无需其他操作。
对于最终用户:你提出一个问题,你就能得到针对你特定问题的、带有注释截图的精确文档。<p>这与 Scribe 或 RAG 有什么不同?
* vs. Scribe:Scribe 用于主动捕获(在你工作时点击)。DocuFine 用于被动提取——它将你现有的原始视频或演示转化为事后的指南。
* vs. RAG:大多数视频 RAG 只是搜索转录文本。DocuFine 使用 LLM“查看”用户界面,然后使用 OCR 将注释“捕捉”到实际的按钮上,因此即使视频静音,指南在空间上也是准确的。<p>该网站尚未上线——我目前正在收集关于该概念和演示的反馈,然后再开放,因为我仍在优化 LLM 成本和提取逻辑。<p>演示链接:
- 初始录制:<a href="https://streamable.com/c5gom5" rel="nofollow">https://streamable.com/c5gom5</a>
- 提问:如何找到客户下的订单?
- 生成的输出指南:<a href="https://streamable.com/9c4ncj" rel="nofollow">https://streamable.com/9c4ncj</a><p>端到端演示:<a href="https://streamable.com/hqb6te" rel="nofollow">https://streamable.com/hqb6te</a><p>欢迎诚实反馈!
查看原文
Before you say another RAG clone, please hear me out for a second.<p>The Problem:
As a creator: You have to screen record, edit, annotate, and then present. If anything changes, you redo the process.
As an end user: You have to watch a 5-minute video when all you need to know is 5 seconds of that video to perform a specific task.<p>The Solution:
For creators: Record and upload your raw screen captures. No further effort.
For end users: You ask a question, and you get exactly the document for your specific question with annotated screenshots.<p>How is this different from Scribe or RAG?
* vs. Scribe: Scribe is for active capture (clicking while you work). DocuFine is for passive extraction—it turns your existing raw videos or demos into guides after the fact.
* vs. RAG: Most video RAG just searches transcripts. DocuFine "sees" the UI using an LLM and then uses OCR to "snap" the annotations to the actual buttons, so the guides are spatially accurate even if the video is silent.<p>The site isn't live yet—I'm currently gathering feedback on the concept and demo before opening it up, as I'm still optimizing the LLM costs and extraction logic.<p>Demo Links:
- Initial Recording: <a href="https://streamable.com/c5gom5" rel="nofollow">https://streamable.com/c5gom5</a>
- Query Asked: How do I find orders placed by a customer?
- Generated Output Guide: <a href="https://streamable.com/9c4ncj" rel="nofollow">https://streamable.com/9c4ncj</a><p>End-to-End-Demo: <a href="https://streamable.com/hqb6te" rel="nofollow">https://streamable.com/hqb6te</a><p>Honest feedback appreciated!