-
Notifications
You must be signed in to change notification settings - Fork 6
Home
Refinery is a tool to extract the data from excel spreadsheets (both in .xls
and .xlsx
format) in a declarative way. The idea is that you focus on what you want to extract, but not on how the data should be extracted.
At Vortexa, we receive tons of excel spreadsheets with important information. Unfortunately, these spreadsheets have different format and populated in many different ways depending on the data source. That is why we came up with the idea of the refinery
. The library allows you to do both extraction of the data and validation/transformation of the extracted data along the way.
This documentation is a step-by-step guide of all features, starting from the simple one and then covering more and more tricky cases. As the sample data we will use Champions League 2021 stats located in /examples
folder
- Basic table extraction
- Multiple tables extraction
- Multiple tables with the anchor extraction