Dealing With Excel Data in PySpark

Have you ever asked yourself, "how do I read in 10,000 Excel files and process them using Spark?" I hope not...it sounds like a terrible task...but in case you have, it just so happens I might have an approach your interested in.

General Approach

PySpark does not …



Replacing the Cron in AWS

Like most apps on the internet, the stuff that I write at Showroom Logic has scheduled tasks that must happen in a predictable fashion. In our case, we have some reports to run and deliver. Our app, like lots of internet apps, is distributed and runs in a Docker container …



JetBrians DataGrip Review

Recently, JetBrains released a new product called DataGrip. It's basically IntelliJ for databases, and so far, I love it! Granted: I'm biased. I do all of my Java and Python development using IntelliJ and PyCharm, respectively. That being said, I think that even if I were still using some other …