无法在包 org.apache.spark.deploy 中访问包部署中的对象 SparkHadoopUtil

问题描述

为什么 SparkHadoopUtil 在这里无法访问,而在较低版本的 spark 中却可以访问,即使它们是导入的?

Welcome to
      ____              __
     / __/__  ___ _____/ /__
    _\ \/ _ \/ _ `/ __/  '_/
   /___/ .__/\_,_/_/ /_/\_\   version 3.0.2
      /_/
         
Using Scala version 2.12.10 (OpenJDK 64-Bit Server VM,Java 1.8.0_282)
Type in expressions to have them evaluated.
Type :help for more information.

scala> import org.apache.spark.deploy.SparkHadoopUtil
import org.apache.spark.deploy.SparkHadoopUtil

scala> import org.apache.hadoop.conf.Configuration
import org.apache.hadoop.conf.Configuration

scala> 

scala> 

scala>  val hadoopConf: Configuration = SparkHadoopUtil.get.conf
<console>:25: error: object SparkHadoopUtil in package deploy cannot be accessed in package org.apache.spark.deploy
        val hadoopConf: Configuration = SparkHadoopUtil.get.conf
                                        ^

scala> 

解决方法

那是因为 SparkHadoopUtil 类在 Spark 3 中已更改为私有类。这是 Spark 2.4Spark 3.0 的来源之间的区别。

火花 2.4:

@DeveloperApi
class SparkHadoopUtil extends Logging {

Spark 3.0:

private[spark] class SparkHadoopUtil extends Logging {