January 03, 2021
Just like most people I know (and at least half the world) the pandemic took hold of a big chunk of my life. Fortunately, I was able to cope with most of it but not with this blog which I haven’t given a second of notice (I’ve only published an article on talkdesk’s).
For this year I’m hoping on giving it more time and focus. I’ll give my best to keep it related to data engineering, analytics, and maybe some software engineering in general. As such, I’ve divided the article into the following topics.
In 2019 I set out on learning a lot but I didn’t have a specific goal. Just some languages and frameworks. What was the consequence? I quickly lost my motivation.
For 2021 I’ve decided that I’ll focus on improving data quality through data observability (you can read a bit about it in this great article).
When looking into some of the biggest gaps in Business Intelligence I’ve come to realize that the industry has been missing on a lot of DevOps and software engineering techniques such as good CI/CD, data tests and data lineage. Another missing point is the use of these pipeline’s metadata and the infrastructure’s metrics to implement a data-driven mindset. Some great tools have started to appear like Great Expectations and DBT to improve our data pipelines and I’m hoping that, by the end of the year I’ve got these in production in pair with airflow metrics (my colleague has published an article with an overview of Talkdesk’s BI architecture).
For 2020 I’ve reinforced my knowledge of Python’s internals. For 2021 I’ll be reading more books and articles and I’ll broaden my knowledge by learning Scala, a functional programming language heavily used in data teams.
As a bonus, I’d like to read more about Flink, Kafka and Rust.
Through the years I’ve tried multiple to-do lists. In 2020 I was trying to write everything on Notion (which I find great) but I found myself unable to make it work as a task app. At the middle of the year, as I was grasping with this issue, I read an article that set me on the right track (not sure where to find it sadly). Just write everything on your calendar!
And it worked. I’m using google calendar and, for now, I have the following flow.
With this flow, I feel in better control of my day and don’t feel bad when I see my huge task list. For this year I’m hoping that this will hold.
2020 wasn’t a bad year.
One of my personal goals was to work remotely and although I got there through a pandemic, this type of work suits me. However, for the rest of my goals I’ve slacked and 2021 needs to be a turning point. I’m hoping that by writing and talking more I’ll become more accountable and, as consequence, realize them.
I'm José Cabeda, a data engineer focused on improving data systems and educating on how to use them. I also do a lot of planning and read as much as I can.