告诉 HN:忘掉选择器和截图吧。代理式 Web 生活在你的 Shell 中

1作者: keepamovin9 天前
这些旧方法太笨重了。完全自主浏览不需要埃隆·马斯克的视觉处理能力。 它只需要 Agentish——代理的原生语言,LLM 的通用语——也就是该死的纯文本。 以及诚实。关于它能做什么(网络上的一切,除了只有你能做的事情),以及它不能做什么,但你能做的事情:MFA、验证码、登录。 一个带有智能防护栏的代理技能和一个设计精良的 Unix 哲学 CLI 工具足以驱动网络上的任何任务。 你也可以试试。以下是我给它的一些任务,它都完成了: - *找到至少 100 条相关的推文,并撰写恰当的回复,向那些正经历其所解决的痛点的人推广 WebCLI* Grok Build with Composer 2.5 Fast - *比较来自 SFO 到 DC 中午的几个提供商(如 Google Flights 和 Kayak 等)的航班。找到最便宜的直飞航班。填写我的详细信息并预订,但在付款前停止。* Codex 5.5 high - *在亚马逊、沃尔玛、阿里巴巴和 lego.com 上寻找一些有趣的乐高产品,找到最酷的套装或数量最多的积木,价格最优惠,并一直进行到填写详细信息的结账环节。* Claude Sonnet 4.6 还有更多。代理总是能搞定。没有截图,没有选择器,只有原始文本和用于操作的编号引用,以及诚实的验证和一系列有用的表面。 核心循环是一个简单的 OODA 循环: ``` web inspect # 代理观察和定位 web do <ref> <opts> # 代理决定和行动 ``` 重复。永远。这样就能完成所有事情。该工具足够小巧,足够透明,代理足够智能,足够持久,它们总能搞定。 这是智能驱动网络任务的新时代。不再是剧本和傀儡,不再是机器人式的“自动化”哔哔声。WebCLI 是网络即兴创作,由代理的智能驱动。 我想继续构建代理技术。想象一下,如果你尝试了这个工具,它为你节省了时间和繁琐的工作。尝试一下,然后付费,因为它很有价值。每个电子邮件域名都可以获得一个免费的、功能齐全的五天试用期,只需提供电子邮件。https://webcli.sh 如果您有大规模使用该工具的想法,请联系我。
查看原文
These old ways are too heavy. Full self browsing doesn’t require Elon Musk vision processing.<p>It just requires Agentish - the agent’s native tongue, the LLM’s lingua franca - frickin plain text.<p>And honesty. About what it can do (everything on the web, besides stuff only you can do), and what it can’t do, but you can: MFA, captcha, login.<p>An agent skill with smart guardrails and a well designed Unix philosophy CLI tool is enough to power any task on the web.<p>You can try it too. Here’s some things I’ve thrown at it and it’s done:<p>- <i>find at least 100 relevant tweets and craft apt replies that promote WebCLI to people experiencing the pain it solves</i> Grok Build with Composer 2.5 Fast<p>- <i>Compare flights from SFO to DC mid afternoon across a couple of providers like Google flights and kayak, etc.. Find the cheapest one with no stops. Fill in my details and book it but stop at payment.</i> Codex 5.5 high<p>- <i>Find some fun Lego products across amazon Walmart Alibaba and lego.com and find the coolest set or large quantity of blocks at the best price and take it all the way to checkout filling in details.</i> Claude Sonnet 4.6<p>And many more. The agent’s always figure it out. No screenshots no selectors, just raw text and numbered references for actions with honest in validation and a bunch of useful surface.<p>The core loop is a simple OODA loop:<p><pre><code> web inspect # agent observes and orients web do &lt;ref&gt; &lt;opts&gt; # agent decides and acts </code></pre> Repeat. Forever. That gets everything done. The tool is small enough, and transparent enough, and agents are smart enough, and persistent enough that they always figure it out.<p>It’s a new era of web task driving with intelligence. No more playwrights and puppets, no more robotic “auto-mation” beep boop. WebCLI is web improvisation, powered by agents’ intelligence.<p>I want to keep building technology for agency. Imagine if you tried this tool and it saved you time and drudgery. Try it and then pay because it’s valuable. You get a free, fully functional five day trial per email domain, with just email. Https:&#x2F;&#x2F;webcli.sh<p>Contact me if you have ideas to use it at scale