Category: Data Wrangling
-
Staging Data from Semi-Structured Text Files to Structured Tables – The All Roads Project

Data Engineering requires the ability to do more than just pull and load files. You will need to be able to cleanse and prepare the source into a useable format. This article dives into what was done to stage the CCC data previously collected for The All Roads Project.
-
The All Roads Project – Collecting Heavenly Data
Every analytics project needs data! For the All Roads Project, the sources are plentiful so focusing the effort is essential. I discuss the sources and steps used to collect them.
-
Empty your cup then try awk
Introduction Whenever a new stream of data comes onto the work scene, it is rarely in a form that is ready to be loaded into a final target table. It is often in the form of a report, where the set might be pivoted and summarized. There will be groaning…
You must be logged in to post a comment.