用于加速开源扫描的高精度文件夹匹配 (HPFM)
1 分•作者: scanosss•7 个月前
SCANOSS 为其 CLI (scanoss-py) 增加了一项新功能:高精度文件夹匹配 (HPFM)。
HPFM 不再逐个匹配每个文件,而是分析被扫描文件夹的完整结构——文件名、目录布局、扩展名和内容——以快速确定它最有可能代表哪个开源组件。
可以把它想象成扫描一辆汽车:传统的扫描会列出所有零件;而 HPFM 会告诉你“这是一辆标致 2008”。
它的用处:
- 加快高容量环境中的扫描速度
- 通过专注于主要组件来减少噪音
- 通过提供“提示”来提高后续扫描的准确性
- 帮助检测被更改或不完整的组件
现已在 scanoss-py 中提供。
文档/GitHub:https://github.com/scanoss/scanoss.py
CLI:scanoss-py scan --folder-hashing <path>
欢迎提供反馈、建议或此功能可能有所帮助的用例。
查看原文
SCANOSS just added a new feature to its CLI (scanoss-py): High Precision Folder Matching (HPFM).<p>Instead of matching every file individually, HPFM analyses the full structure of a scanned folder—file names, directory layout, extensions, and contents—to quickly determine which open source component it most likely represents.<p>Think of it like scanning a car: traditional scanning lists all the parts; HPFM tells you “this is a Peugeot 2008.”<p>Why it’s useful:
– Speeds up scanning in high-volume environments
– Reduces noise by focusing on the dominant component
– Can improve the accuracy of follow-up scans by providing a “hint”
– Helps detect altered or incomplete components<p>Available now in scanoss-py.
Docs/GitHub: https://github.com/scanoss/scanoss.py
CLI: scanoss-py scan --folder-hashing <path><p>Would love to hear feedback, suggestions, or use cases this might help with.