用于加速开源扫描的高精度文件夹匹配 (HPFM)

1作者: scanosss7 个月前
SCANOSS 为其 CLI (scanoss-py) 增加了一项新功能:高精度文件夹匹配 (HPFM)。 HPFM 不再逐个匹配每个文件,而是分析被扫描文件夹的完整结构——文件名、目录布局、扩展名和内容——以快速确定它最有可能代表哪个开源组件。 可以把它想象成扫描一辆汽车:传统的扫描会列出所有零件;而 HPFM 会告诉你“这是一辆标致 2008”。 它的用处: - 加快高容量环境中的扫描速度 - 通过专注于主要组件来减少噪音 - 通过提供“提示”来提高后续扫描的准确性 - 帮助检测被更改或不完整的组件 现已在 scanoss-py 中提供。 文档/GitHub:https://github.com/scanoss/scanoss.py CLI:scanoss-py scan --folder-hashing <path> 欢迎提供反馈、建议或此功能可能有所帮助的用例。
查看原文
SCANOSS just added a new feature to its CLI (scanoss-py): High Precision Folder Matching (HPFM).<p>Instead of matching every file individually, HPFM analyses the full structure of a scanned folder—file names, directory layout, extensions, and contents—to quickly determine which open source component it most likely represents.<p>Think of it like scanning a car: traditional scanning lists all the parts; HPFM tells you “this is a Peugeot 2008.”<p>Why it’s useful: – Speeds up scanning in high-volume environments – Reduces noise by focusing on the dominant component – Can improve the accuracy of follow-up scans by providing a “hint” – Helps detect altered or incomplete components<p>Available now in scanoss-py. Docs&#x2F;GitHub: https:&#x2F;&#x2F;github.com&#x2F;scanoss&#x2F;scanoss.py CLI: scanoss-py scan --folder-hashing &lt;path&gt;<p>Would love to hear feedback, suggestions, or use cases this might help with.