cancel
Showing results for 
Search instead for 
Did you mean: 
Knowledge Sharing Hub
Dive into a collaborative space where members like YOU can exchange knowledge, tips, and best practices. Join the conversation today and unlock a wealth of collective wisdom to enhance your experience and drive success.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

SumitSingh
by Contributor
  • 3824 Views
  • 7 replies
  • 12 kudos

From Associate to Professional: My Learning Plan to ace all Databricks Data Engineer Certifications

In today’s data-driven world, the role of a data engineer is critical in designing and maintaining the infrastructure that allows for the efficient collection, storage, and analysis of large volumes of data. Databricks certifications holds significan...

SumitSingh_0-1721402402230.png SumitSingh_1-1721402448677.png SumitSingh_2-1721402469214.png
  • 3824 Views
  • 7 replies
  • 12 kudos
Latest Reply
sandeepmankikar
New Contributor III
  • 12 kudos

As an additional tip for those working towards both the Associate and Professional certifications, I recommend avoiding a long gap between the two exams to maintain your momentum. If possible, try to schedule them back-to-back with just a few days in...

  • 12 kudos
6 More Replies
MichTalebzadeh
by Valued Contributor
  • 1363 Views
  • 3 replies
  • 0 kudos

Financial Crime detection with the help of Apache Spark, Data Mesh and Data Lake

For those interested in Data Mesh and Data Lakes for FinCrime detection:Data mesh is a relatively new architectural concept for data management that emphasizes domain-driven data ownership and self-service data availability. It promotes the decentral...

Knowledge Sharing Hub
data lakes
Data Mesh
financial crime
spark
  • 1363 Views
  • 3 replies
  • 0 kudos
Latest Reply
carrolbeau
Visitor
  • 0 kudos

It's great that you're focusing on financial crime detection with advanced technologies like Apache Spark, Data Mesh, and Data Lake. For those looking to dive deeper into criminal records and related data, tools like KY criminal lookup can provide es...

  • 0 kudos
2 More Replies
ThomazRossito
by Contributor
  • 2252 Views
  • 1 replies
  • 1 kudos

Post: Lakehouse Federation - Databricks

Lakehouse Federation - Databricks In the world of data, innovation is constant. And the most recent revolution comes with Lakehouse Federation, a fusion between data lakes and data warehouses, taking data manipulation to a new level. This advancement...

Knowledge Sharing Hub
data engineer
Lakehouse
SQL Analytics
  • 2252 Views
  • 1 replies
  • 1 kudos
Latest Reply
Freshman
New Contributor III
  • 1 kudos

Hey Quick Question, Can we use it for the production version ? We have application server as SQL server, we are planning to use lakehouse federation so we can bypass creating and maintaining 100 of workflows. as we a small dataset I am not too sure o...

  • 1 kudos
Shahram
by New Contributor II
  • 44 Views
  • 0 replies
  • 1 kudos

Hub Star Modeling 2.0 for Medalion Architecture

Excited to share my latest publication on arXiv!“Hub Star Modeling 2.0 for Medallion Architecture” https://arxiv.org/abs/2504.08788This new version builds on the original Hub Star Modeling approach, published last year, and now tailored for the Meda...

  • 44 Views
  • 0 replies
  • 1 kudos
genevive_mdonça
by Databricks Employee
  • 257 Views
  • 1 replies
  • 5 kudos

Handling Complex Nested JSON in Databricks Using schemaHints

When I first got into managing schemas in Databricks, it took me a while to realize that putting in a little planning up front could save me a ton of headaches later on.I was working with these deeply nested, constantly changing JSON files. At first,...

  • 257 Views
  • 1 replies
  • 5 kudos
Latest Reply
Advika
Databricks Employee
  • 5 kudos

Great tip @genevive_mdonça! schemaHints help avoid issues with evolving JSON data, making data processing more reliable and easier to maintain. Thanks for sharing.

  • 5 kudos
techgeorge
by New Contributor II
  • 281 Views
  • 1 replies
  • 0 kudos

Understanding Coalesce, Skewed Joins, and Why AQE Doesn't Always Intervene

In Spark, data skew can be the silent killer of performance. One wide partition pulling in 90% of the data?But even with AQE (Adaptive Query Execution) turned on in Databricks, skewness isn't always automatically identified— and here’s why.What Is co...

Data Skew.png
  • 281 Views
  • 1 replies
  • 0 kudos
Latest Reply
BigRoux
Databricks Employee
  • 0 kudos

@mark_ott , this question seems right up your alley. Care to comment?

  • 0 kudos
Emil_Kaminski
by Contributor II
  • 9689 Views
  • 3 replies
  • 5 kudos

Materials to pass Databricks Data Engineering Associate Exam

Hi Guys, I have passed it already some time ago, but just recently have summarized all the materials which helped me to do it. Pay special attention to GitHub repository, which contains many great exercises prepared by Databricks teamhttps://youtu.be...

  • 9689 Views
  • 3 replies
  • 5 kudos
Latest Reply
Alexa_Wadee
New Contributor II
  • 5 kudos

I passed my Databricks Data Engineering Associate exam after studying with https://bit.ly/4iaflcm. Their extensive collection of mock tests and Practice Software significantly boosted my score to 93%.

  • 5 kudos
2 More Replies
techgeorge
by New Contributor II
  • 270 Views
  • 0 replies
  • 0 kudos

How to train a Convolutional Neural Network on Databricks with Tensorflow and Keras

Here is how to trained a lightweight Convolutional Neuronal Network (CNN) to detect pneumonia from chest X-rays pictures on Azure Databricks. I promise no LLMs, no hype, just real-world deep learning:1. Built it with TensorFlow & Keras on Databricks2...

techgeorge_0-1743756172384.png
  • 270 Views
  • 0 replies
  • 0 kudos
shubham_meshram
by New Contributor II
  • 351 Views
  • 0 replies
  • 0 kudos

When Did the Data Go Wrong? Using Delta Lake Time Travel for Investigation in Databricks

I. IntroductionData pipelines are the lifeblood of modern data-driven organizations. However, even the most robust pipelines can experience unexpected issues: data corruption, erroneous updates, or sudden data drops. When these problems occur, quickl...

shubham_meshram_0-1743459167949.png
  • 351 Views
  • 0 replies
  • 0 kudos
pradeepvatsvk
by New Contributor III
  • 527 Views
  • 0 replies
  • 1 kudos

Inclusion of special characters while saving or downloading as a csv

Hi All, I have data which looks like this High Corona40% 50cl Pm £13.29  but when saving it as a csv it is getting converted into High Corona40% 50cl Pm £13.29    . wherever we have the euro sign . I thing to note here is while displaying the data i...

  • 527 Views
  • 0 replies
  • 1 kudos
Brahmareddy
by Honored Contributor III
  • 683 Views
  • 0 replies
  • 1 kudos

Use Query Patterns to Suggest Indexes Dynamically

Hey folks,Ever notice how a query that used to run super fast suddenly starts dragging? We’ve all been there. As data grows, those little inefficiencies in your SQL start showing up — and they show up hard. That’s where something cool comes in: using...

  • 683 Views
  • 0 replies
  • 1 kudos
Brahmareddy
by Honored Contributor III
  • 3663 Views
  • 6 replies
  • 4 kudos

My Journey with Schema Management in Databricks

When I first started handling schema management in Databricks, I realized that a little bit of planning could save me a lot of headaches down the road. Here’s what I’ve learned and some simple tips that helped me manage schema changes effectively. On...

  • 3663 Views
  • 6 replies
  • 4 kudos
Latest Reply
Brahmareddy
Honored Contributor III
  • 4 kudos

Haha, glad it made sense! Joao.Try it out, and if you run into any issues, just let me know. Always happy to help! And best friends? You got it!

  • 4 kudos
5 More Replies
DataDarvish
by New Contributor II
  • 575 Views
  • 0 replies
  • 1 kudos

Unit Testing for Data Engineering: How to Ensure Production-Ready Data Pipelines

In today’s data-driven world, the success of any business use case relies heavily on trust in the data. This trust is built upon key pillars such as data accuracy, consistency, freshness, and overall quality. When organizations release data into prod...

  • 575 Views
  • 0 replies
  • 1 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
OSZAR »