kafka-connect 文件脉冲连接器独立无法启动(在偏移管理器上找不到类异常) [英] kafka-connect file-pulse connector standalone could not start(class not found exception on offset manager)
问题描述
我想使用 filepulse 连接器将 xml 文件加载到 kafka.
以下是我的环境:
- Win10 WSL,安装了 Ubuntu
- 下载融合平台 5.5.1(参见https://www.confluent.io/download/"),解压
- 从 github (https://github.com/streamthoughts/kafka-connect-file-pulse/releases),解压
- 修改了connect-standalone.properties";位于融合路径(etc/kafka/connect-standalone.properties)下以包含路径/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib"
- 我没有创建任何主题.
我启动了zookeeper;kafka,并尝试启动 kafka-connect 独立,如下所示:
$ zookeeper-server-start etc/kafka/zookeeper.properties$ kafka-server-start etc/kafka/server.properties$ 连接独立 \etc/kafka/connect-standalone.properties \/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/etc/quickstart-connect-file-pulse-csv.properties
但我有失败见下文
[2020-09-08 15:57:45,522] INFO 从以下位置加载插件:/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib/kafka-connect-file-脉冲表达式-1.5.2.jar (org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader:239)[2020-09-08 15:57:45,541] INFO 注册加载器:PluginClassLoader{pluginLocation=file:/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib/kafka-connect-file-pulse-expression-1.5.2.jar} (org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader:262)[2020-09-08 15:57:45,541] 信息加载插件来自:/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib/kafka-connect-file-pulse-filters-1.5.2.jar (org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader:239)[2020-09-08 15:57:45,553] INFO 注册加载器:PluginClassLoader{pluginLocation=file:/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib/kafka-connect-file-pulse-filters-1.5.2.jar} (org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader:262)[2020-09-08 15:57:45,554] 信息加载插件来自:/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib/kafka-connect-file-pulse-plugin-1.5.2.jar (org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader:239)[2020-09-08 15:57:45,575] 错误由于错误而停止 (org.apache.kafka.connect.cli.ConnectStandalone:130)java.lang.NoClassDefFoundError: io/streamthoughts/kafka/connect/filepulse/offset/OffsetManager在 java.lang.Class.getDeclaredConstructors0(Native Method)在 java.lang.Class.privateGetDeclaredConstructors(Class.java:2671)在 java.lang.Class.getConstructor0(Class.java:3075)在 java.lang.Class.newInstance(Class.java:412)在 org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.versionFor(DelegatingClassLoader.java:385)在 org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.getPluginDesc(DelegatingClassLoader.java:355)在 org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.scanPluginPath(DelegatingClassLoader.java:328)在 org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.scanUrlsAndAddPlugins(DelegatingClassLoader.java:261)在 org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.registerPlugin(DelegatingClassLoader.java:253)在 org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.initPluginLoader(DelegatingClassLoader.java:222)在 org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.initLoaders(DelegatingClassLoader.java:199)在 org.apache.kafka.connect.runtime.isolation.Plugins.(Plugins.java:60)在 org.apache.kafka.connect.cli.ConnectStandalone.main(ConnectStandalone.java:79)引起:java.lang.ClassNotFoundException:io.streamthoughts.kafka.connect.filepulse.offset.OffsetManager在 java.net.URLClassLoader.findClass(URLClassLoader.java:382)在 java.lang.ClassLoader.loadClass(ClassLoader.java:418)在 org.apache.kafka.connect.runtime.isolation.PluginClassLoader.loadClass(PluginClassLoader.java:104)在 java.lang.ClassLoader.loadClass(ClassLoader.java:351)……还有 13 个
问题你能帮忙解决什么问题吗?
ps.下面是我对 jdbc 连接器的尝试,只是为了看看其他连接器是否正常工作.它没有有例外.
connect-standalone \etc/kafka/connect-standalone.properties \/home/min/confluent-5.5.1/etc/kafka-connect-jdbc/sink-quickstart-sqlite.properties
pps.下面是file-pulse connector的内容
(我没做改动)
connector.class=io.streamthoughts.kafka.connect.filepulse.source.FilePulseSourceConnector主题=连接文件脉冲快速启动csv任务.max=1过滤器 = ParseDelimitedRow# 分隔行过滤器filters.ParseDelimitedRow.extractColumnName=headersfilters.ParseDelimitedRow.trimColumn=truefilters.ParseDelimitedRow.type=io.streamthoughts.kafka.connect.filepulse.filter.DelimitedRowFilter跳过.headers=1task.reader.class=io.streamthoughts.kafka.connect.filepulse.reader.RowFileInputReader#文件扫描fs.cleanup.policy.class=io.streamthoughts.kafka.connect.filepulse.clean.LogCleanupPolicyfs.scanner.class=io.streamthoughts.kafka.connect.filepulse.scanner.local.LocalFSDirectoryWalkerfs.scan.directory.path=/tmp/kafka-connect/examples/fs.scan.interval.ms=10000# 内部报告internal.kafka.reporter.bootstrap.servers=localhost:9092internal.kafka.reporter.id=connect-file-pulse-quickstart-csvinternal.kafka.reporter.topic=connect-file-pulse-status# 按名称和哈希跟踪文件offset.strategy=name+hash
pps.以下是 connect-standalone.properties
文件的关键信息(我添加/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib
)
我不认为这是一个完美的答案,但这正是我让它发挥作用的方式.
我发现'file-pulse'连接器的plugin.path
属性必须是jar文件的父文件夹.
所以下面有效.
$ cat etc/kafka/connect-standalone.properties
提取的关键信息:
plugin.path=/user/share/java,/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2# plugin.path=/usr/share/java,/home/min/confluent-5.5.1/share/java/kafka-connect-jdbc
在它像下面这样之前,并像我提到的那样抛出异常.
plugin.path=/user/share/java,/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib# plugin.path=/usr/share/java,/home/min/confluent-5.5.1/share/java/kafka-connect-jdbc
也许我不理解 connect-standalone.properties
属性文件中的以下说明,或者文件脉冲连接器没有遵循此处所述的标准/说明.当然,JDBC 连接器的路径是包含 JAR 文件的路径.
# a) 直接包含带有插件及其依赖项的 jar 的目录# b) 带有插件及其依赖项的 uber-jars# c) 直接包含插件类及其依赖项的包目录结构的目录
I want to use filepulse connector to load xml files to kafka.
below are my environment:
- Win10 WSL, installed Ubuntu
- downloaded the confluent platform 5.5.1 (see "https://www.confluent.io/download/"), unpacked
- downloaded zip file version 1.5.2 from github (https://github.com/streamthoughts/kafka-connect-file-pulse/releases), unzipped
- modified the "connect-standalone.properties" located under confluent path (etc/kafka/connect-standalone.properties) to include path "/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib"
- I did not create any topic.
I started zookeeper; kafka, and tried to start kafka-connect standalone like below:
$ zookeeper-server-start etc/kafka/zookeeper.properties
$ kafka-server-start etc/kafka/server.properties
$ connect-standalone \
etc/kafka/connect-standalone.properties \
/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/etc/quickstart-connect-file-pulse-csv.properties
but I have failures see below
[2020-09-08 15:57:45,522] INFO Loading plugin from: /home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib/kafka-connect-file-pulse-expression-1.5.2.jar (org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader:239)
[2020-09-08 15:57:45,541] INFO Registered loader: PluginClassLoader{pluginLocation=file:/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib/kafka-connect-file-pulse-expression-1.5.2.jar} (org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader:262)
[2020-09-08 15:57:45,541] INFO Loading plugin from: /home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib/kafka-connect-file-pulse-filters-1.5.2.jar (org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader:239)
[2020-09-08 15:57:45,553] INFO Registered loader: PluginClassLoader{pluginLocation=file:/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib/kafka-connect-file-pulse-filters-1.5.2.jar} (org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader:262)
[2020-09-08 15:57:45,554] INFO Loading plugin from: /home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib/kafka-connect-file-pulse-plugin-1.5.2.jar (org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader:239)
[2020-09-08 15:57:45,575] ERROR Stopping due to error (org.apache.kafka.connect.cli.ConnectStandalone:130)
java.lang.NoClassDefFoundError: io/streamthoughts/kafka/connect/filepulse/offset/OffsetManager
at java.lang.Class.getDeclaredConstructors0(Native Method)
at java.lang.Class.privateGetDeclaredConstructors(Class.java:2671)
at java.lang.Class.getConstructor0(Class.java:3075)
at java.lang.Class.newInstance(Class.java:412)
at org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.versionFor(DelegatingClassLoader.java:385)
at org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.getPluginDesc(DelegatingClassLoader.java:355)
at org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.scanPluginPath(DelegatingClassLoader.java:328)
at org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.scanUrlsAndAddPlugins(DelegatingClassLoader.java:261)
at org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.registerPlugin(DelegatingClassLoader.java:253)
at org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.initPluginLoader(DelegatingClassLoader.java:222)
at org.apache.kafka.connect.runtime.isolation.DelegatingClassLoader.initLoaders(DelegatingClassLoader.java:199)
at org.apache.kafka.connect.runtime.isolation.Plugins.<init>(Plugins.java:60)
at org.apache.kafka.connect.cli.ConnectStandalone.main(ConnectStandalone.java:79)
Caused by: java.lang.ClassNotFoundException: io.streamthoughts.kafka.connect.filepulse.offset.OffsetManager
at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
at java.lang.ClassLoader.loadClass(ClassLoader.java:418)
at org.apache.kafka.connect.runtime.isolation.PluginClassLoader.loadClass(PluginClassLoader.java:104)
at java.lang.ClassLoader.loadClass(ClassLoader.java:351)
... 13 more
question Can you please help what could be the fix?
ps. below is what I tried for jdbc connector, just to see whether other connectors are working or not. It did not have exceptions.
connect-standalone \
etc/kafka/connect-standalone.properties \
/home/min/confluent-5.5.1/etc/kafka-connect-jdbc/sink-quickstart-sqlite.properties
pps. below is the content of file-pulse connector
(I did not make changes)
connector.class=io.streamthoughts.kafka.connect.filepulse.source.FilePulseSourceConnector
topic=connect-file-pulse-quickstart-csv
tasks.max=1
filters=ParseDelimitedRow
# Delimited Row filter
filters.ParseDelimitedRow.extractColumnName=headers
filters.ParseDelimitedRow.trimColumn=true
filters.ParseDelimitedRow.type=io.streamthoughts.kafka.connect.filepulse.filter.DelimitedRowFilter
skip.headers=1
task.reader.class=io.streamthoughts.kafka.connect.filepulse.reader.RowFileInputReader
# File scanning
fs.cleanup.policy.class=io.streamthoughts.kafka.connect.filepulse.clean.LogCleanupPolicy
fs.scanner.class=io.streamthoughts.kafka.connect.filepulse.scanner.local.LocalFSDirectoryWalker
fs.scan.directory.path=/tmp/kafka-connect/examples/
fs.scan.interval.ms=10000
# Internal Reporting
internal.kafka.reporter.bootstrap.servers=localhost:9092
internal.kafka.reporter.id=connect-file-pulse-quickstart-csv
internal.kafka.reporter.topic=connect-file-pulse-status
# Track file by name and hash
offset.strategy=name+hash
ppps. below are the key info for the connect-standalone.properties
file (I added the /home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib
)
# Set to a list of filesystem paths separated by commas (,) to enable class loading isolation for plugins
# (connectors, converters, transformations). The list should consist of top level directories that include
# any combination of:
# a) directories immediately containing jars with plugins and their dependencies
# b) uber-jars with plugins and their dependencies
# c) directories immediately containing the package directory structure of classes of plugins and their dependencies
# Note: symlinks will be followed to discover dependencies or plugins.
# Examples:
# plugin.path=/usr/local/share/java,/usr/local/share/kafka/plugins,/opt/connectors,
plugin.path=/usr/share/java,/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib
# plugin.path=/usr/share/java,/home/min/confluent-5.5.1/share/java/kafka-connect-jdbc
I would not call this one a perfect answer, but it is just the way I made it working.
I found the plugin.path
property for the 'file-pulse' connector has to be the parent folder of the jar files.
so below worked.
$ cat etc/kafka/connect-standalone.properties
key info extracted:
plugin.path=/user/share/java,/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2
# plugin.path=/usr/share/java,/home/min/confluent-5.5.1/share/java/kafka-connect-jdbc
before it was like below, and throw exception as I mentioned.
plugin.path=/user/share/java,/home/min/streamthoughts-kafka-connect-file-pulse-1.5.2/lib
# plugin.path=/usr/share/java,/home/min/confluent-5.5.1/share/java/kafka-connect-jdbc
maybe I did not understand below instructions in the connect-standalone.properties
property file, or maybe the file-pulse connectors did not follow the standards/instructions stated here. certainly, the JDBC connectors the path is the path containing the JAR files.
# a) directories immediately containing jars with plugins and their dependencies
# b) uber-jars with plugins and their dependencies
# c) directories immediately containing the package directory structure of classes of plugins and their dependencies
这篇关于kafka-connect 文件脉冲连接器独立无法启动(在偏移管理器上找不到类异常)的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!