Show HN: VAERS DuckDB 数据库
2 分•作者: yehosef•9 个月前
我一直在使用 VAERS 数据库,最近将其导入到 DuckDB 中进行本地分析。该数据集存在各种问题,我尝试修复以提高数据集的质量。
导入脚本位于 <a href="https://github.com/yehosef/vaers-duckdb" rel="nofollow">https://github.com/yehosef/vaers-duckdb</a>
这里提供一个现成的数据文件(约 3GB)<a href="https://drive.google.com/file/d/1d3wRRr2UFvCYR9r7J5XBym2dYKTsMKtF/view?usp=sharing" rel="nofollow">https://drive.google.com/file/d/1d3wRRr2UFvCYR9r7J5XBym2dYKT...</a>
我计划添加一些仪表板,类似于我最初的 Elasticsearch 项目。我还想添加一些嵌入,用于向量搜索。 正在进行中(WIP)。
我很想听听您的想法!
查看原文
I've been playing with the VAERS database for a while and I just got through importing it into duckdb for local analytics. The data set has various problems that I try to fix to improve the quality of the dataset.<p>The import scripts are <a href="https://github.com/yehosef/vaers-duckdb" rel="nofollow">https://github.com/yehosef/vaers-duckdb</a><p>A ready data file is available here (~3GB) <a href="https://drive.google.com/file/d/1d3wRRr2UFvCYR9r7J5XBym2dYKTsMKtF/view?usp=sharing" rel="nofollow">https://drive.google.com/file/d/1d3wRRr2UFvCYR9r7J5XBym2dYKT...</a><p>I'm planning to add some dashboards like my original elasticsearch project. I would also like to add some embeddings for vector search. WIP<p>I'd love to hear what you think!