top of page


Microsoft SQL Server on Amazon Web Services Cloud
Today one of my customers asked me to help him with SQL Server installation using his AWS cloud account. So, it's a great opportunity to...

Vitaly Bruk
Oct 30, 20242 min read
1


Clean Data with Spark
Cleaning data is a very common task for data professionals. The data we read from source systems are sometimes corrupt, duplicated, or...

Chen Hirsh
Oct 16, 20241 min read
2


Monitor Databricks costs with the new Dashboard and budgets
As data engineers, we need to monitor the usage and costs of our data solutions. Databricks recently released tools to help us do that.

Chen Hirsh
Oct 6, 20241 min read
7


SQL Windows Functions might be non-deterministic
How using SQL Windows functions with non unique order column can cause indeterminate results

Chen Hirsh
Oct 1, 20241 min read
3


Databricks workflows for-each - not quite there yet
Databricks recently added a for-each task to their workflow capability. How does it work and what are its limitations?

Chen Hirsh
Sep 24, 20241 min read
8


Instant data replication with Databricks table cloning
Cloning tables in Databricks is a fast way to create replicated data for test proposes, or archiving.

Chen Hirsh
Sep 15, 20241 min read
2


Working with Excel files in Databricks
Working in Databricks, you can read and write Excel files, but you need to pay attention to some pitfalls.

Chen Hirsh
Sep 8, 20241 min read
1


A simple code source control for SQL Server
A simple source control for SQL server code objects like views and stored procedures.

Chen Hirsh
Sep 1, 20241 min read
1


How to display missing months in Power BI charts without showing future dates
When you’re using Power BI, you probably create line charts that show data by month pretty often. It usually works great, but what...

Alon Ohayon
Aug 28, 20243 min read
8


The mystery of the vanishing variable
A Python variable has vanished! Can you help the confused Data Engineer find out why?

Chen Hirsh
Aug 25, 20241 min read
3
bottom of page