Discussions - Databricks Community

Register to join the community

Discussions

Engage in dynamic conversations covering diverse topics within the Databricks Community. Explore discussions on data engineering, machine learning, and more. Join the conversation and expand your knowledge base with insights from experts and peers.

Browse the Community

Databricks Platform Discussions

Dive into comprehensive discussions covering various aspects of the Databricks platform. Join the co...

13541 Posts

Community Discussions

Engage in vibrant discussions covering diverse learning topics within the Databricks Community. Expl...

3671 Posts

Activity in Discussions

Sorted by:

Start a conversation

by rodneyc8063 > • New Contributor II

an hour ago

22 Views
1 replies
0 kudos

Why are there 3 courses (SQL Analytics on Databricks) for the Databricks Data Analyst Certification?

Hi folksI was interested in doing the certification for the "Databricks Certified Data Analyst Associate" here:https://www.databricks.com/learn/certification/data-analyst-associateLooking at the "Related Training" section I see recommended training i...

22 Views
1 replies
0 kudos

an hour ago

Latest Reply

Advika
Databricks Employee

12m ago

0 kudos

Hello @rodneyc8063! Yes, all three courses cover the same content, the difference lies in the format and access: 2-hour Self-Paced: Free, video-only3-hour Self-Paced: Paid, includes hands-on labs4-hour Instructor-Led: Paid, includes labs and a live i...

0 kudos

12m ago

by Klusener > • New Contributor III

Monday

67 Views
9 replies
5 kudos

Smaller dataset causing OOM on large cluster

I have a pyspark job reading the input data volume of just ~50-55GB Parquet data from a delta table on Databricks. Job is using n2-highmem-4 GCP VM and 1-15 worker with autoscaling on databricks. Each workerVM of type n2-highmem-4 has 32GB memory and...

Community Platform Discussions

67 Views
9 replies
5 kudos

Monday

Latest Reply

Klusener
New Contributor III

25m ago

5 kudos

Much appreciate @mark_ott and @BigRoux for the prompt response.The job uses below cluster/settings. Cluster/spark version - Driver: n2-highmem-4 · Workers: n2-highmem-4 · 5-15 workers · DBR: 15.4 LTS (includes Apache Spark 3.5.0, Scala 2.12) on GCPP...

5 kudos

25m ago

by VVM > • New Contributor III

02-27-2023 3:12:56 PM

19278 Views
14 replies
3 kudos

Resolved! Databricks SQL - Unable to Escape Dollar Sign ($) in Column Name

It seems that due to how Databricks processes SQL cells, it's impossible to escape the $ when it comes to a column name.I would expect the following to work:%sql SELECT 'hi' `$id`The backticks ought to escape everything. And indeed that's exactly wha...

Data Engineering

19278 Views
14 replies
3 kudos

02-27-2023 3:12:56 PM

Latest Reply

rgower
New Contributor II

51m ago

3 kudos

+1 here - hoping to hear any updates.

3 kudos

51m ago

13 More Replies

by DaPo > • New Contributor II

2 hours ago

21 Views
0 replies
0 kudos

Model Serving Endpoint: Cuda-OOM for Custom Model

Hello all,I am tasked to evaluate a new LLM for some use-cases. In particular, I need to build a POC for a chat bot based on that model. To that end, I want to create a custom Serving Endpoint for an LLM pulled from huggingfaces. The model itself is...

Machine Learning

21 Views
0 replies
0 kudos

2 hours ago

by anmol-aidora > • New Contributor

Thursday

146 Views
6 replies
0 kudos

Resolved! Serverless: ERROR: Could not install packages due to an OSError: [Errno 13] Permission denied

Hello guys!I am getting this error when running a job:ERROR: Could not install packages due to an OSError: [Errno 13] Permission denied: '/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.11/site-packages/some-python-package'I have lis...

Data Engineering

146 Views
6 replies
0 kudos

Thursday

Latest Reply

anmol-aidora
New Contributor

2 hours ago

0 kudos

Thanks for clarifying Isi, really appreciate it

0 kudos

2 hours ago

by aravind-ey > • New Contributor

02-27-2025 5:00:17 PM

875 Views
5 replies
1 kudos

vocareum lab access

Hi I am doing a data engineering course in databricks(Partner labs) and would like to have access to vocareum workspace to practice using the demo sessions.can you please help me to get the access to this workspace?regards,Aravind

Data Engineering

875 Views
5 replies
1 kudos

02-27-2025 5:00:17 PM

Latest Reply

twnlBO
New Contributor II

03-11-2025 7:47:57 PM

1 kudos

Can you please provide links? screenshot? more info? This answer is not specific enough.I'm taking the Data Analysis learning path, there are different demos I'd like to practice and there are no SP Lab environment links as mentioned in the videos.

1 kudos

03-11-2025 7:47:57 PM

by soumiknow > • Contributor II

12-10-2024 11:00:20 PM

3035 Views
22 replies
1 kudos

Resolved! BQ partition data deleted fully even though 'spark.sql.sources.partitionOverwriteMode' is DYNAMIC

We have a date (DD/MM/YYYY) partitioned BQ table. We want to update a specific partition data in 'overwrite' mode using PySpark. So to do this, I applied 'spark.sql.sources.partitionOverwriteMode' to 'DYNAMIC' as per the spark bq connector documentat...

Data Engineering

3035 Views
22 replies
1 kudos

12-10-2024 11:00:20 PM

Latest Reply

VZLA
Databricks Employee

01-08-2025 4:51:00 AM

1 kudos

@soumiknow , Just checking if there are any further questions, and did my last comment help?

1 kudos

01-08-2025 4:51:00 AM

21 More Replies

by MaartenH > • Visitor

7 hours ago

46 Views
3 replies
1 kudos

Lakehouse federation for SQL server: database name with spaces

We're currently using lakehouse federation for various sources (Snowflake, SQL Server); usually succesful. However we've encountered a case where one of the databases on the SQL Server has spaces in its name, e.g. 'My Database Name'. We've tried vari...

Data Engineering

46 Views
3 replies
1 kudos

7 hours ago

Latest Reply

SP_6721
New Contributor III

4 hours ago

1 kudos

Hi @MaartenH Can you try creating the foreign catalog like this?CREATE FOREIGN CATALOG your_catalog_name USING CONNECTION your_connection_nameOPTIONS ( database '[My Database Name]');(Do check that the Foreign catalog name must follow Unity Catalog...

1 kudos

4 hours ago

by moseb > • New Contributor II

a month ago

440 Views
2 replies
0 kudos

Problem with ipywidgets and plotly on Databricks

Hi everyone, I am encountering a problem when using ipywidgets with plotly on Databricks. I am trying to pass interactive arguments to a function and then plot with plotly. When I do the followingdef f(m, b) : plt.figure(2) x = np.linspace(-10,...

Machine Learning

440 Views
2 replies
0 kudos

a month ago

Latest Reply

moseb
New Contributor II

3 hours ago

0 kudos

Thanks for the suggestion! You're absolutely right. The code was already all in my message, but I can make it easier to copy-paste (and add the imports):from ipywidgets import interactive import matplotlib.pyplot as plt import numpy as np def f(m, b...

0 kudos

3 hours ago

by M_S > • Visitor

7 hours ago

59 Views
2 replies
0 kudos

Dataframe is getting empty during execution of daily job with random pattern

Hello, I have a daily ETL job that adds new records to a table for the previous day. However, from time to time, it does not produce any output.After investigating, I discovered that one table is sometimes loaded as empty during execution. As a resul...

Data Engineering

59 Views
2 replies
0 kudos

7 hours ago

Latest Reply

M_S
Visitor

3 hours ago

0 kudos

Thank you very much, @BigRoux , for such a detailed and insightful answer!All tables used in this processing are managed Delta tables loaded through Unity Catalog.I will try running it with spark.databricks.io.cache.enabled set to false just to see i...

0 kudos

3 hours ago

by Kaz > • New Contributor II

10-05-2023 3:14:26 AM

7464 Views
3 replies
1 kudos

Show full logs on job log

Is it possible to show the full logs of a databricks job? Currently, the logs are skipped with:*** WARNING: max output size exceeded, skipping output. ***However, I don't believe our log files are more than 20 MB. I know you can press the logs button...

Warehousing & Analytics

7464 Views
3 replies
1 kudos

10-05-2023 3:14:26 AM

Latest Reply

Isi
Contributor III

3 hours ago

1 kudos

Hey @Kaz ,Unfortunately, the output truncation limit in the Databricks job UI cannot be changed. Once that limit is exceeded, the rest of the logs are skipped, and the full logs become accessible only through the “Logs” button, which, as you mentione...

1 kudos

3 hours ago

by SeekingSolution > • New Contributor II

3 hours ago

24 Views
0 replies
0 kudos

Unity Catalog Enablement

Hello,After scouring documentation yesterday, I was finally able to get unity catalog enabled and assigned to my workspace. Or so I thought. When I run the CURRENT METASTORE() command I get the below error:However, when I look at my catalog I can see...

Data Engineering

24 Views
0 replies
0 kudos

3 hours ago

by bjn > • Visitor

7 hours ago

34 Views
3 replies
0 kudos

Trigger bad records in databricks

I use bad records while reading a csv as follows:df = spark.read.format("csv") .schema(schema) .option("badRecordsPath", bad_records_path) Since bad records are not written immediately, I want to know how can trigger the write...

Data Engineering

34 Views
3 replies
0 kudos

7 hours ago

Latest Reply

Isi
Contributor III

3 hours ago

0 kudos

Hey @bjn ,1) Yes, if you run both df.write.format("noop")... and df.write.format("delta").saveAsTable(...), you’re triggering two separate actions, and Spark will evaluate the DataFrame twice. That includes parsing the CSV and, importantly, processin...

0 kudos

3 hours ago

by chinmay0924 > • New Contributor II

a week ago

204 Views
5 replies
0 kudos

Spark connect client and server versions should be same for executing UDFs

I am trying to execute pandas UDF in databricks. It gives me the following error on serverless compute,File /local_disk0/.ephemeral_nfs/envs/pythonEnv-b11ff17c-9b25-4ccb-927d-06a7d1ca7221/lib/python3.11/site-packages/pyspark/sql/connect/client/core.p...

Get Started Discussions

204 Views
5 replies
0 kudos

a week ago

Latest Reply

BigRoux
Databricks Employee

4 hours ago

0 kudos

Serverless is management free which means you cannot choose the image. Hope this helps. Lou.

0 kudos

4 hours ago

by harshajain > • Visitor

5 hours ago

27 Views
1 replies
0 kudos

how to get databricks host name

how to get databricks host name from trial account

Databricks Free Trial Help

27 Views
1 replies
0 kudos

5 hours ago

Latest Reply

Renu_
Contributor

4 hours ago

0 kudos

Hi @harshajainFollow below steps:Log in to the Databricks trial portalAccess the Workspace provided upon login.The hostname is the part of the URL. For example, if the URL is https://trial-1234568.cloud.databricks.com, the hostname is trial-1234568.

0 kudos

4 hours ago

Top Kudoed Authors

User

Count

1819

891

473

315

313

Featured Posts

Data + AI Summit 2025 — registration now open!

Data + AI Summit 2025 — registration now open!

Upskill yourself and your teams at Data+AI Summit

Upskill yourself and your teams at Data+AI Summit

[eBook] Migrate your legacy data warehouse to Databricks

[eBook] Migrate your legacy data warehouse to Databricks

	
		OSZAR »