Show HN: VAERS DuckDB 数据库

2作者: yehosef9 个月前
我一直在使用 VAERS 数据库,最近将其导入到 DuckDB 中进行本地分析。该数据集存在各种问题,我尝试修复以提高数据集的质量。 导入脚本位于 <a href="https://github.com/yehosef/vaers-duckdb" rel="nofollow">https://github.com/yehosef/vaers-duckdb</a> 这里提供一个现成的数据文件(约 3GB)<a href="https://drive.google.com/file/d/1d3wRRr2UFvCYR9r7J5XBym2dYKTsMKtF/view?usp=sharing" rel="nofollow">https://drive.google.com/file/d/1d3wRRr2UFvCYR9r7J5XBym2dYKT...</a> 我计划添加一些仪表板,类似于我最初的 Elasticsearch 项目。我还想添加一些嵌入,用于向量搜索。 正在进行中(WIP)。 我很想听听您的想法!
查看原文
I&#x27;ve been playing with the VAERS database for a while and I just got through importing it into duckdb for local analytics. The data set has various problems that I try to fix to improve the quality of the dataset.<p>The import scripts are <a href="https:&#x2F;&#x2F;github.com&#x2F;yehosef&#x2F;vaers-duckdb" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;yehosef&#x2F;vaers-duckdb</a><p>A ready data file is available here (~3GB) <a href="https:&#x2F;&#x2F;drive.google.com&#x2F;file&#x2F;d&#x2F;1d3wRRr2UFvCYR9r7J5XBym2dYKTsMKtF&#x2F;view?usp=sharing" rel="nofollow">https:&#x2F;&#x2F;drive.google.com&#x2F;file&#x2F;d&#x2F;1d3wRRr2UFvCYR9r7J5XBym2dYKT...</a><p>I&#x27;m planning to add some dashboards like my original elasticsearch project. I would also like to add some embeddings for vector search. WIP<p>I&#x27;d love to hear what you think!