org.apache.spark.SparkException: Job aborted due to stage failure: Task 4 in stage 24.0 failed 1 times, most recent failure: Lost task 4.0 in stage 24.0 (TID 76, localhost): org.apache.spark.api.python.PythonException: Traceback (most recent call last): File "/databricks/spark/python/pyspark/worker.py", line 111, in main process() File "/databricks/spark/python/pyspark/worker.py", line 106, in process serializer.dump_stream(func(split_index, iterator), outfile) File "/databricks/spark/python/pyspark/rdd.py", line 2346, in pipeline_func return func(split, prev_func(split, iterator)) File "/databricks/spark/python/pyspark/rdd.py", line 2346, in pipeline_func return func(split, prev_func(split, iterator)) File "/databricks/spark/python/pyspark/rdd.py", line 317, in func return f(iterator) File "/databricks/spark/python/pyspark/rdd.py", line 1776, in combineLocally merger.mergeValues(iterator) File "/databricks/spark/python/pyspark/shuffle.py", line 236, in mergeValues for k, v in iterator: File "<ipython-input-31-4b09041aa30b>", line 1, in <lambda> File "<ipython-input-28-f43debc22073>", line 3, in parseNeighbors File "/databricks/python/lib/python2.7/re.py", line 171, in split return _compile(pattern, flags).split(string, maxsplit) TypeError: expected string or buffer

Searched on Google with the first line of a JAVA stack trace?

We can recommend more relevant solutions and speed up debugging when you paste your entire stack trace with the exception message. Try a sample exception.

Recommended solutions based on your search

Samebug tips

Do you know how to solve this issue? Write a tip to help other users and build your expert profile.

Solutions on the web

via Stack Overflow by John Constantine
, 1 year ago
Job aborted due to stage failure: Task 4 in stage 24.0 failed 1 times, most recent failure: Lost task 4.0 in stage 24.0 (TID 76, localhost): org.apache.spark.api.python.PythonException: Traceback (most recent call last): File "/databricks/spark
via Stack Overflow by Emdadul
, 1 year ago
Job aborted due to stage failure: Task 0 in stage 46.0 failed 1 times, most recent failure: Lost task 0.0 in stage 46.0 (TID 63, localhost): org.apache.spark.api.python.PythonException: Traceback (most recent call last): File "C:\Spark\python\lib
via Stack Overflow by Algina
, 11 months ago
Job aborted due to stage failure: Task 0 in stage 0.0 failed 1 times, most recent failure: Lost task 0.0 in stage 0.0 (TID 0, localhost): org.apache.spark.api.python.PythonException: Traceback (most recent call last): File "/home/alg/programs
via Stack Overflow by Edamame
, 1 year ago
Job aborted due to stage failure: Task 0 in stage 30.0 failed 4 times, most recent failure: Lost task 0.3 in stage 30.0 (TID 52, ph-hdp-prd-dn02): org.apache.spark.api.python.PythonException: Traceback (most recent call last): File "/data/0/yarn
via Stack Overflow by ThatDataGuy
, 1 year ago
Job aborted due to stage failure: Task 14 in stage 50.0 failed 4 times, most recent failure: Lost task 14.3 in stage 50.0 (TID 7583, brllxhtce01.bluecrest.local): org.apache.spark.api.python.PythonException: Traceback (most recent call last): File
via Stack Overflow by TheStupidOne
, 1 year ago
Job aborted due to stage failure: Task 93 in stage 6.0 failed 4 times, most recent failure: Lost task 93.3 in stage 6.0 (TID 172, test-138): org.apaark.api.python.PythonException: Traceback (most recent call last): File "/usr/local/spark/spark
org.apache.spark.SparkException: Job aborted due to stage failure: Task 4 in stage 24.0 failed 1 times, most recent failure: Lost task 4.0 in stage 24.0 (TID 76, localhost): org.apache.spark.api.python.PythonException: Traceback (most recent call last): File "/databricks/spark/python/pyspark/worker.py", line 111, in main process() File "/databricks/spark/python/pyspark/worker.py", line 106, in process serializer.dump_stream(func(split_index, iterator), outfile) File "/databricks/spark/python/pyspark/rdd.py", line 2346, in pipeline_func return func(split, prev_func(split, iterator)) File "/databricks/spark/python/pyspark/rdd.py", line 2346, in pipeline_func return func(split, prev_func(split, iterator)) File "/databricks/spark/python/pyspark/rdd.py", line 317, in func return f(iterator) File "/databricks/spark/python/pyspark/rdd.py", line 1776, in combineLocally merger.mergeValues(iterator) File "/databricks/spark/python/pyspark/shuffle.py", line 236, in mergeValues for k, v in iterator: File "<ipython-input-31-4b09041aa30b>", line 1, in <lambda> File "<ipython-input-28-f43debc22073>", line 3, in parseNeighbors File "/databricks/python/lib/python2.7/re.py", line 171, in split return _compile(pattern, flags).split(string, maxsplit) TypeError: expected string or buffer
at org.apache.spark.api.python.PythonRunner$$anon$1.read(PythonRDD.scala:166)
at org.apache.spark.api.python.PythonRunner$$anon$1.(PythonRDD.scala:207)
at org.apache.spark.api.python.PythonRunner.compute(PythonRDD.scala:125)
at org.apache.spark.api.python.PythonRDD.compute(PythonRDD.scala:70)
at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:306)
at org.apache.spark.rdd.RDD.iterator(RDD.scala:270)
at org.apache.spark.api.python.PairwiseRDD.compute(PythonRDD.scala:342)
at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:306)
at org.apache.spark.rdd.RDD.iterator(RDD.scala:270)
at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:79)
at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:46)
at org.apache.spark.scheduler.Task.run(Task.scala:96)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)

Users with the same issue

2 times, 1 year ago
Samebug visitor profile picture
Unknown user
Once, 1 year ago
Samebug visitor profile picture
Unknown user
Once, 1 year ago
Samebug visitor profile picture
Unknown user
Once, 1 year ago
Samebug visitor profile picture
Unknown user
Once, 1 year ago

Write tip

Know the solutions? Share your knowledge to help other developers to debug faster.