HackerNews中文版

我正在开发一项面向自由及开放源代码软件（FOSS）开发者的服务，旨在帮助他们强制执行代码许可合规性，并使项目更具可持续性。面临的挑战是：许多网站的服务条款明确禁止抓取、爬取或自动化操作。与此同时，所需的信息（代码库、依赖项、元数据）往往只能通过这些网站获取。对于那些围绕开源生态系统构建工具的人来说： * 你们如何在遵守服务条款限制的同时，仍然为用户提供价值？ * 你们是否只专注于官方 API，即使它们的功能有限？ * 在这种情况下，是否有已确立的法律/技术最佳实践？ * 如何平衡遵守服务条款与支持 FOSS 的使命？很想听听其他人在这个领域做过什么（或看到过什么有效的方法）。

查看原文

I'm working on a service for FOSS developers to help enforce code license compliance and make projects more sustainable.The challenge: many websites' Terms of Service explicitly prohibit scraping, crawling, or automation. At the same time, the information needed (repos, dependencies, metadata) is often available only through those sites.For those who've built tools around open source ecosystems:* How do you navigate ToS restrictions while still delivering value to users?* Do you focus on official APIs only, even if they're limited?* Are there established legal/technical best practices for this situation?* How to balance compliance with ToS and the mission of supporting FOSS?Curious to hear what others have done (or seen work) in this space.

Ask HN：当服务条款禁止抓取时，如何为 FOSS 项目构建服务？