Jupyter Notebook中的PySpark问题

问题描述

初始化时出现此错误。我已经设置了master和worker,然后启动了它。尽管在设置了master和spark之后,我没有启动spark-shell来直接运行它。有人可以帮忙吗..

代码

import findspark
import os
os.environ["SPARK_HOME"]="C:\Spark"
findspark.init()
from pyspark.sql import SparkSession
from pyspark import SparkConf,SparkContext

错误

IndexError                                Traceback (most recent call last)
c:\users\khan\appdata\local\programs\python\python37\lib\site-packages\findspark.py in init(spark_home,python_path,edit_rc,edit_profile)
    142     try:
--> 143         py4j = glob(os.path.join(spark_python,"lib","py4j-*.zip"))[0]
    144     except IndexError:

IndexError: list index out of range

During handling of the above exception,another exception occurred:

Exception                                 Traceback (most recent call last)
<ipython-input-7-db62de47bcf3> in <module>
      2 import os
      3 os.environ["SPARK_HOME"]="C:\Spark"
----> 4 findspark.init()
      5 from pyspark.sql import SparkSession
      6 from pyspark import SparkConf,SparkContext

c:\users\khan\appdata\local\programs\python\python37\lib\site-packages\findspark.py in init(spark_home,edit_profile)
    144     except IndexError:
    145         raise Exception(
--> 146             "Unable to find py4j,your SPARK_HOME may not be configured correctly"
    147         )
    148     sys.path[:0] = [spark_python,py4j]

Exception: Unable to find py4j,your SPARK_HOME may not be configured correctly

解决方法

暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!

如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。

小编邮箱:dio#foxmail.com (将#修改为@)