Ask HN:以 AGPLv3 许可发布代码,但想阻止 LLM 重构?

3作者: zionsati17 天前
我正准备根据 AGPLv3 发布一个软件项目。目标是实现传统的 copyleft 互惠原则——如果你使用或托管它,请分享你的修改。 然而,我对当前的法律环境有清醒的认识。大型科技公司正以“合理使用”为幌子,将公共代码视为训练大型语言模型(LLM)的免费原材料。我担心会有公司摄取我的代码库,并利用 LLM 来有效地“洗白”其中的逻辑,从而允许他们的用户通过提示词获得我软件的干净、闭源的重现版本,而不会触发 AGPL。 我们是否有专门的许可证来阻止这种情况,同时又能保持开源软件的健康发展?我们是否有 LLM 抓取器会遵守的 `llm.txt` 或 `robots.txt` 文件?我觉得整个开源软件模式正面临威胁,甚至比以前更甚(例如,大公司从 Linux 实例中赚取了数十亿美元,却无需支付任何软件许可费用,但他们却乐于向他人收取自己操作系统的费用)。
查看原文
I am preparing to release a software project under the AGPLv3. The goal is traditional copyleft reciprocity - if you use it or host it, share your changes.<p>However, I am realistic about the current legal landscape. Big tech corps are treating public code as free raw material for LLM training under the banner of &quot;Fair Use&quot;. I am concerned that a company will ingest my codebase and use an LLM to effectively launder the logic, allowing their users to prompt a clean, closed-source recreation of my software without triggering the AGPL.<p>Do we have a licence specifically to prevent this but still keep OSS healthy and alive? Do we have a llm.txt &#x2F; robots.txt that LLM scrapers respect? I feel that the whole OSS model is under threat here, even more than before (e.g. big corps earn billions from Linux instances without having to pay any software licensing cost, but they&#x27;re more than happy to charge others for their own OS).