cancel
Showing results for 
Search instead for 
Did you mean: 
Resources
Explore a comprehensive repository of resources on the Databricks Community. Access tutorials, guides, webinars, and more to enhance your skills in data analytics and machine learning.
cancel
Showing results for 
Search instead for 
Did you mean: 

Browse the Community

Get Started Resources

Explore essential resources to kickstart your journey with Databricks. Access tutorials, guides, and...

0 Posts

Events

Stay updated on Databricks events, including webinars, conferences, and workshops. Discover opportun...

119 Posts

Support FAQs

Find answers to common questions and troubleshoot issues with Databricks support FAQs. Access helpfu...

19 Posts

Technical Blog

Explore in-depth articles, tutorials, and insights on data analytics and machine learning in the Dat...

217 Posts

Knowledge Sharing Hub

Dive into a collaborative space where members like YOU can exchange knowledge, tips, and best practi...

139 Posts

Announcements

Stay up-to-date with the latest announcements from Databricks. Learn about product updates, new feat...

113 Posts

DatabricksTV

Community-produced videos to help you leverage Databricks in your Data & AI journey. Tune in to expl...

137 Posts

Activity in Resources

dineshvk
by Databricks Employee
  • 49 Views
  • 0 replies
  • 3 kudos

Best Practices for Migrating Spark ETL Workloads to Databricks

Best Practices for Migrating Spark ETL Workloads to Databricks Introduction Migrating Spark ETL workloads to Databricks unlocks faster performance, lower costs, and enhanced scalability. With built-in support for Delta Lake, automated cluster managem...

dineshvk_0-1746626931903.png Screenshot 2024-12-12 at 12.35.40 PM.png
  • 49 Views
  • 0 replies
  • 3 kudos
Malcoln
by Databricks Employee
  • 310 Views
  • 2 replies
  • 5 kudos

How to Use the Transform Pattern in PySpark for Modular and Maintainable ETL

Data engineers, scientists, and analysts working with big data often rely on PySpark to build ETL (Extract-Transform-Load) pipelines. These pipelines typically involve a series of transformations applied to raw data to clean, enrich, and prepare it f...

logo_blog.png
  • 310 Views
  • 2 replies
  • 5 kudos
Latest Reply
MatheusMalta
  • 5 kudos

Congratulations on the publication, it helped me a lot, I didn't know I could return a dataframe in a function.It is certainly excellent for code reuse.

  • 5 kudos
1 More Replies
brett-aulbaugh
by Databricks Employee
  • 95 Views
  • 0 replies
  • 1 kudos

5 ways to leverage Databricks Assistant to go from Petroleum Engineer to Data Scientist

5 ways to leverage Databricks Assistant to go from Petroleum Engineer to Data Scientist In today's rapidly evolving energy landscape, petroleum engineers face unprecedented challenges that extend far beyond traditional reservoir management. While man...

brettaulbaugh_0-1745580184697.png brettaulbaugh_1-1745580184737.png brettaulbaugh_2-1745580184588.png brettaulbaugh_3-1745580184546.png
  • 95 Views
  • 0 replies
  • 1 kudos
Sujitha
by Databricks Employee
  • 58 Views
  • 0 replies
  • 2 kudos

Announcing updates to Databricks Query Profiles

At Databricks, we’re always working hard to make your queries run faster. Still, there are times when it is helpful to look a little deeper to see how your queries are turned into execution plans and distributed for parallel execution. That’s where Q...

Screenshot 2025-05-07 at 3.56.53 PM.png
  • 58 Views
  • 0 replies
  • 2 kudos
Sujitha
by Databricks Employee
  • 58 Views
  • 0 replies
  • 1 kudos

[eBook] Migrate your legacy data warehouse to Databricks

Low-risk migration to a fast and open data warehouse Legacy enterprise data warehouses (EDWs) are becoming a bottleneck for businesses aiming to scale operations and adopt advanced analytics. Traditional EDWs struggle with: Scalability: Expensive har...

Screenshot 2025-05-07 at 3.53.14 PM.png
  • 58 Views
  • 0 replies
  • 1 kudos
Sujitha
by Databricks Employee
  • 71 Views
  • 0 replies
  • 0 kudos

Upskill yourself and your teams at Data+AI Summit

Get 20% off paid training today using the code: TRNGDQ9TY. We are experiencing an unprecedented pace of technological innovation driven by AI and data. According to the World Economic Forum’s Future of Jobs report, six out of ten business leaders exp...

Screenshot 2025-05-07 at 3.35.22 PM.png
  • 71 Views
  • 0 replies
  • 0 kudos
MichTalebzadeh
by > Valued Contributor
  • 1371 Views
  • 3 replies
  • 0 kudos

Financial Crime detection with the help of Apache Spark, Data Mesh and Data Lake

For those interested in Data Mesh and Data Lakes for FinCrime detection:Data mesh is a relatively new architectural concept for data management that emphasizes domain-driven data ownership and self-service data availability. It promotes the decentral...

Knowledge Sharing Hub
data lakes
Data Mesh
financial crime
spark
  • 1371 Views
  • 3 replies
  • 0 kudos
Latest Reply
carrolbeau
Visitor
  • 0 kudos

It's great that you're focusing on financial crime detection with advanced technologies like Apache Spark, Data Mesh, and Data Lake. For those looking to dive deeper into criminal records and related data, tools like KY criminal lookup can provide es...

  • 0 kudos
2 More Replies
amcclendon
by New Contributor II
  • 122 Views
  • 0 replies
  • 0 kudos

Managing LLM Pretraining data using Mosaic Data Sharding

Handling large-scale datasets efficiently is one of the biggest bottlenecks in modern machine learning workflows and when pre-training LLMs. As datasets grow in size and complexity, traditional methods like memmap arrays or PyTorch DataLoaders can st...

amcclendon_1-1745937199262.png amcclendon_2-1745937199263.png amcclendon_3-1745937199265.png amcclendon_4-1745937199269.png
  • 122 Views
  • 0 replies
  • 0 kudos
stevejohansen
by Databricks Employee
  • 135 Views
  • 0 replies
  • 2 kudos

Unity Catalog with automatic enablement (Part 2 - AWS)

IntroductionWhat is a Workspace Catalog?How Automatic Workspace Assignment WorksAWS infrastructure deployed during automatic enablementSystem-owned groups and permissionsMetastore-level grants for Auto-Enabled Workspace AdministratorsBest practices f...

uc-by-default-aws-overview.png uc-by-default-flow-aws.png uc-by-default-aws-metastore-tick.png uc-by-default-aws-infrapng.png
  • 135 Views
  • 0 replies
  • 2 kudos
stevejohansen
by Databricks Employee
  • 165 Views
  • 0 replies
  • 3 kudos

Unity Catalog with automatic enablement (Part 1 - Azure)

IntroductionWhat is a Workspace Catalog?How Automatic Workspace Assignment WorksAzure infrastructure deployed during automatic enablementSystem-owned groups and permissionsMetastore-level grants for Auto-Enabled Workspace AdministratorsBest practices...

uc-by-default-securables-azure.png uc-by-default-flow-azure.png uc-by-default-azure-no-metastore.png uc-by-default-azure-infra.png
  • 165 Views
  • 0 replies
  • 3 kudos
josh_melton
by Databricks Employee
  • 63 Views
  • 0 replies
  • 0 kudos

Exploring Code With Databricks Apps

Introduction In this article, we’ll walk through how to develop, configure, and deploy a Databricks App for understanding complex code bases. While there’s a lot of freely available, world class software to explore in open source, it’s challenging to...

Screenshot 2025-04-23 at 9.27.04 AM.png
  • 63 Views
  • 0 replies
  • 0 kudos
greg-hansen
by Databricks Employee
  • 93 Views
  • 0 replies
  • 1 kudos

Monitoring Structured Streaming in Production with StreamingQueryListener

Intro Databricks customers use Structured Streaming to drive critical business functions like equipment monitoring, fraud detection, and inventory management. Reliability is a key design factor for these workloads. Engineers design streaming jobs for...

greghansen_0-1746105345098.png greghansen_1-1746105345205.png greghansen_2-1746105345242.png greghansen_3-1746105345235.png
  • 93 Views
  • 0 replies
  • 1 kudos
NikithaTirumala
by New Contributor II
  • 75 Views
  • 0 replies
  • 0 kudos

[Partner Blog] Using GenAI on Databricks to Implement Master Data Management (MDM)

Summary Learn how Lakefusion modernizes MDM using GenAI on Databricks.Discover real-world use cases of AI-driven master stand the architectural synergy between Lakehouse and Lakefusion. Introduction: The Challenge of MDM in the Modern Data Landscape...

visitin_card_side_1 (1).png Provider 360 Architecture Diagram (4).png visitin_card_side_1 (1).png
  • 75 Views
  • 0 replies
  • 0 kudos
ThomazRossito
by > Contributor
  • 2252 Views
  • 1 replies
  • 1 kudos

Post: Lakehouse Federation - Databricks

Lakehouse Federation - Databricks In the world of data, innovation is constant. And the most recent revolution comes with Lakehouse Federation, a fusion between data lakes and data warehouses, taking data manipulation to a new level. This advancement...

Knowledge Sharing Hub
data engineer
Lakehouse
SQL Analytics
  • 2252 Views
  • 1 replies
  • 1 kudos
Latest Reply
Freshman
New Contributor III
  • 1 kudos

Hey Quick Question, Can we use it for the production version ? We have application server as SQL server, we are planning to use lakehouse federation so we can bypass creating and maintaining 100 of workflows. as we a small dataset I am not too sure o...

  • 1 kudos
RamGoli
by Databricks Employee
  • 443 Views
  • 4 replies
  • 5 kudos

Fine-Grained Data Security with Column Mask and Row Level Security in DLT

Overview of Data Security: Data security is a cornerstone of any modern analytics or data processing environment, ensuring that sensitive information remains protected throughout its lifecycle. Organizations must comply with regulations like GDPR, HI...

RamGoli_0-1746106283806.png RamGoli_1-1746106493401.png RamGoli_2-1746105883394.png RamGoli_0-1746107190413.png
  • 443 Views
  • 4 replies
  • 5 kudos
Latest Reply
RamGoli
Databricks Employee
  • 5 kudos

I am looking forward to ABAC as well! 

  • 5 kudos
3 More Replies
Top Kudoed Authors
OSZAR »