Ask HN:当服务条款禁止抓取时,如何为 FOSS 项目构建服务?
1 分•作者: ATechGuy•9 个月前
我正在开发一项面向自由及开放源代码软件(FOSS)开发者的服务,旨在帮助他们强制执行代码许可合规性,并使项目更具可持续性。
面临的挑战是:许多网站的服务条款明确禁止抓取、爬取或自动化操作。与此同时,所需的信息(代码库、依赖项、元数据)往往只能通过这些网站获取。
对于那些围绕开源生态系统构建工具的人来说:
* 你们如何在遵守服务条款限制的同时,仍然为用户提供价值?
* 你们是否只专注于官方 API,即使它们的功能有限?
* 在这种情况下,是否有已确立的法律/技术最佳实践?
* 如何平衡遵守服务条款与支持 FOSS 的使命?
很想听听其他人在这个领域做过什么(或看到过什么有效的方法)。
查看原文
I'm working on a service for FOSS developers to help enforce code license compliance and make projects more sustainable.<p>The challenge: many websites' Terms of Service explicitly prohibit scraping, crawling, or automation. At the same time, the information needed (repos, dependencies, metadata) is often available only through those sites.<p>For those who've built tools around open source ecosystems:<p>* How do you navigate ToS restrictions while still delivering value to users?<p>* Do you focus on official APIs only, even if they're limited?<p>* Are there established legal/technical best practices for this situation?<p>* How to balance compliance with ToS and the mission of supporting FOSS?<p>Curious to hear what others have done (or seen work) in this space.