csvtidy: Open-Source CLI Tool Merges and Cleans CSV Files with Reusable Recipes
A developer has released csvtidy, a free, open-source command-line tool designed to automate repetitive CSV file cleaning and merging tasks. The tool allows users to save cleanup steps — such as removing duplicates, trimming whitespace, and normalizing dates — as reusable YAML recipe files that can be re-run each month without reconfiguration. Built on DuckDB, csvtidy streams data rather than loading entire files into memory, enabling it to handle CSV files larger than available RAM. It supports Unix-style piping, runs entirely locally to protect sensitive data, and is installable via pip. The project is MIT-licensed and available on GitHub, and serves as the open-source CLI counterpart to the developer's visual desktop tool, Kramata.
This is an AI-generated summary. ShortSingh links to the original source for the complete article.
Discussion (0)
Log in to join the discussion and vote.
Log in