{"id":622,"date":"2025-02-12T04:16:03","date_gmt":"2025-02-12T04:16:03","guid":{"rendered":"https:\/\/datadandies.nl\/?p=622"},"modified":"2025-02-12T04:16:03","modified_gmt":"2025-02-12T04:16:03","slug":"do-you-have-what-it-takes-to-be-a-data-engineer","status":"publish","type":"post","link":"https:\/\/datadandies.nl\/index.php\/2025\/02\/12\/do-you-have-what-it-takes-to-be-a-data-engineer\/","title":{"rendered":"Do you have what it takes to be a Data Engineer?"},"content":{"rendered":"\n<p>Currently reading \u201cData Pipelines Pocket Reference\u201d by <a href=\"https:\/\/www.linkedin.com\/in\/ACoAAACT2b4BTMir8wKY4XpkYFCB4kwP_yE9Yvw\"><\/a><a href=\"https:\/\/www.linkedin.com\/in\/jamesdensmore\/\">James Densmore<\/a>.<br><br>Interesting little book.<br><br>In the beginning James gives a summarized checklist of what a Data Engineer needs to have to be succesful.<br><br>In this post I&#8217;m trying to see if I have what it takes to be a Data Engineer according to the prerequisites as listed in the book.<br><br>&#8212; SQL and Data- Warehousing &amp; Modeling Fundamentals &#8212;<br><br>I have developed several data warehouses. This included designing the data model. I have most experience with the Kimball data modeling approach however, I have used the \u201cone big flat table\u201d approach as well when the situation lended itself for it.<br><br>Although I have worked with several data warehouses that were built using a data vault data modeling approach, I have never designed a data vault myself. This is a gap in my knowledge that I plan to fill at one point.<br><br>&#8212; Python and\/or Java &#8212;<br><br>I have experience with Python, but no experience whatsoever with Java. Up till now, I have not needed Java yet. Would be cool to pick up another programming language though, so whenever the need arises I would not mind learning more about the language.<br><br>&#8212; Distributed computing &#8212;<br><br>I know of, but have never implemented a distributed computing platform like Hadoop or Apache Spark. This is another gap in my knowledge according to the book.<br><br>&#8212; Basic system administration (Linux command line) &#8212;<br><br>Expectations of skills that are mentioned in the book that fall under this header include:<br><br>&#8211;\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0Analyze application logs: I have some experience with that, especially with containers that run in the cloud however, I would definitely need to increase my knowledge in this area.<br>&#8211;\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0Schedule cron jobs: I should be fine here.<br>&#8211;\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0Troubleshoot firewall and other security settings: I have done this so many times but honestly, it barely ever feels easy. Some problems can be as simple as not being granted access yet to a certain object. Other problems are more complex; Entra ID not being synced with Snowflake which is why a user is not granted a certain role yet, proxy has not yet been set in the environment variables which is why Power BI cannot connect with Snowflake, etc.<br><br>&#8212; Goal-oriented mentality &#8212;<br><br>Defined as mainly soft skills in the book: talking to data- analysts, scientists and stakeholders: I do have a lot of experience with this particular demand. I started out building reports in Power BI and Excel which required a lot of communication with stakeholders. I find that talking often and thoroughly with the people for who you develop a certain solution helps a lot with actually developing something that will be useful &amp; used.<br><br>How do you score in this checklist?<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Currently reading \u201cData Pipelines Pocket Reference\u201d by James Densmore. Interesting little book. In the beginning James gives a summarized checklist of what a Data Engineer needs to have to be succesful. In this post I&#8217;m trying to see if I have what it takes to be a Data Engineer according to the prerequisites as listed&hellip;<\/p>\n<p class=\"more-link\"><a href=\"https:\/\/datadandies.nl\/index.php\/2025\/02\/12\/do-you-have-what-it-takes-to-be-a-data-engineer\/\" class=\"themebutton\">Read More<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[62,61],"class_list":["post-622","post","type-post","status-publish","format-standard","hentry","category-blog","tag-bookdatapipelinespocketreference","tag-bookreview"],"_links":{"self":[{"href":"https:\/\/datadandies.nl\/index.php\/wp-json\/wp\/v2\/posts\/622","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/datadandies.nl\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/datadandies.nl\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/datadandies.nl\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/datadandies.nl\/index.php\/wp-json\/wp\/v2\/comments?post=622"}],"version-history":[{"count":1,"href":"https:\/\/datadandies.nl\/index.php\/wp-json\/wp\/v2\/posts\/622\/revisions"}],"predecessor-version":[{"id":623,"href":"https:\/\/datadandies.nl\/index.php\/wp-json\/wp\/v2\/posts\/622\/revisions\/623"}],"wp:attachment":[{"href":"https:\/\/datadandies.nl\/index.php\/wp-json\/wp\/v2\/media?parent=622"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/datadandies.nl\/index.php\/wp-json\/wp\/v2\/categories?post=622"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/datadandies.nl\/index.php\/wp-json\/wp\/v2\/tags?post=622"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}