7 tutorial series · 2 projects · 14 posts
The xlfilldown tool is a CLI and Python library designed for processing Excel data into SQLite or new Excel workbooks. It efficiently handles hierarchical and independent fill modes to maintain data integrity. Key features include row hashing, constant-memory processing, and raw ingestion options. Users can choose between CLI commands or a Python API for seamless integration into their workflows.
Sep 23, 2025I've put together a python package to automate the iptables firewall commands we've covered. It's available NOW on pypi.org
Oct 11, 2022pgmonkey is a handy Python library that simplifies PostgreSQL connection management with just one YAML config file. It supports multiple connection types—like sync, async, and pooled—while handling caching and lifecycle tasks seamlessly. Ideal for projects with mixed connection needs, it saves developers from configuration chaos. Worth checking out!
Feb 17, 2026Tired of dealing with messy spreadsheets that complicate your data workflow? Enter xlfilldown — the ultimate solution for data engineers seeking clean, queryable data. This powerful tool efficiently fills hierarchical data, ensuring context is preserved without memory overload. With a simple command, you can cleanse and streamline large Excel files, saving precious time while avoiding tedious manual fixes. Embrace hassle-free data management today!
Feb 11, 2026Excel may appear simple, but it conceals a daunting data challenge: the hierarchical spreadsheet. Common among analysts, it presents issues like merged cells and hidden hierarchies that frustrate data processing. Enter xlfilldown—a powerful solution that transforms chaotic Excel files into clean, structured datasets for seamless analysis. By bridging the gap between visual reports and data integrity, xlfilldown revolutionizes the ETL process for data professionals.
Oct 13, 2025The content discusses the performance and reliability of the xlfilldown tool, which efficiently processes large Excel datasets without excessive memory usage. A stress test processing 1,000,000 rows demonstrated rapid speeds—13,800 rows/sec for Excel and 44,700 rows/sec for SQLite—while maintaining deterministic and auditable output. This enhances data integrity and workflow trust.
Oct 6, 2025Pleasantly surprised when revisiting python packaging with setuptools and TOML in 2024.
Feb 19, 2024Using Git Submodules with your python packages helps stop you from re-inventing the wheel. Make one wheel, and use it in every project. Let me show you how.
Nov 25, 2022You might need to store data at the package level, take a look at the following package structure. I am going to show you how to create, read and write to a database at the package level. This means that when a user installs your package, you could store data in the python package. We’re...
Nov 23, 2022There may be occasions where you want to store application data and configuration files outside of the reach of your users, and store it directly in the package files themselves.
Nov 19, 2022The python package ‘argparse’ is a highly versatile and customizable command line parser. I’m going to show you a handful of examples that should cover most of your use cases, if not you can reference the full ‘argparse’ documentation here. Come on – let’s take a look. Prerequisites Although you could apply the examples we...
Sep 6, 2022Wouldn't you like to be able to wrap up all of your cool code in to a package, and then call your code from the command line using a user friendly command name? Sure you do! Come on, let me show you how.
Sep 1, 2022There are a couple of different ways to install python packages. Come on, let's take a look at them.
Sep 1, 2022I've got a good one for you today. What a ride. It seems I have caught "setuptools" at a transitional moment. Installing python packages in editable / development mode with TOML "pyproject.toml" files is now possible - but a little rough around the edges for now. Come on - let's take a look.
Aug 31, 2022You need to know how to package your python projects properly. Here I show you how to package, and submit your package to your very own pypi.org repository. I create the package "pkgexample" version 1.0.1 , your package name will vary if you follow along.
Aug 28, 2022This content provides a comprehensive guide on authenticating Google Cloud service accounts for Python API calls, detailing three methods: explicit file loading, Application Default Credentials, and manual token extraction. It emphasizes the importance of service accounts, API enablement, and maintaining security with key files, while offering practical coding examples and best practices.
Jun 16, 2022
You must be logged in to post a comment.