Ask HN:哪里可以追踪 AI 模型训练成本的趋势?
2 分•作者: hedayet•8 个月前
我对训练 AI 模型(计算、能源、数据等)的成本随时间的变化很感兴趣。
是否有公开资源或数据集跟踪开源模型的训练成本(我猜这些数据很难获取,对于闭源模型来说,但如果我错了,我很乐意被纠正。)
我特别想了解哪些架构上的变化(例如,注意力机制变体、参数共享、混合专家模型)导致了主要的成本优化,而且不仅仅来自这些模型背后的公司,还包括任何训练或复现过这些模型的人。
查看原文
I'm curious how the cost of training AI models (compute, energy, data, etc) has changed over time.<p>Are there any public resources or datasets tracking training costs for open-weight models (I'm guessing this data is hard to get for closed models, but happy to be proved wrong.)<p>I'm especially interested in understanding which architectural changes (e.g., attention variants, parameter sharing, mixture-of-experts) have led to major cost optimizations, and NOT just from the companies behind these models, but from anyone who has trained or replicated them.