有人试过吗 spark-solr
https://github.com/lucidworks/spark-solr
当我运行这个
val options = Map("collection" -> "DS", "zkhost" -> "url:2181,url:2181,url:2181")
val df = spark.read.format("solr").options(options).load
然后我得到这个错误
at solr.DefaultSource.createRelation(DefaultSource.scala:14)
at org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:317)
at org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:223)
at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:211)
at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:167)
... 49 elided
Caused by: java.lang.ClassNotFoundException: org.apache.solr.client.solrj.io.stream.expr.StreamExpressionParameter
at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
... 54 more
我的安装程序是Ubuntu18,Solr7.4,Spark2.4.0,Scala2.11.12
1条答案
按热度按时间ou6hu8tu1#
这个
solrj
项目中缺少lib。添加
solrj
lib as nit应该适合你。