Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Columnar 1.0.0
-
Columnar Edition 1.0.0 build 2190 4-node cluster (16 vcpus + 64 GB)
-
Untriaged
-
0
-
Unknown
Description
The workload is as follows -
7 links ( 3 remote links + 2 external + 2 Kafka)
50 million items per remote collection * 80 collections
~8 million items per standalone collection * 300 collections
10 million items per Kafka collection * 40.
But this was seen during a local test run. The test framework re-attempts a http request in case a response is not received within the default timeout period. I have run a create Kafka collection and since no response was obtained, it has sent another request and then another (3 in total). I have fixed the test now to run with a timeout of 10 minutes, but this was the collection -
`Database0QuzaRSxN`.`scope1kNrdWyfg`.`LinkedDatasetnizURFRokx`
|
At some point it has run into nullPointer exception.
Exceptions seen ->
node 001
2024-07-03T07:29:07.645+00:00 FATA CBAS.active.ActiveManager [Executor-2089:448249404ea709b61341753fefde669e] Failed to stop runtime: (Default.linkWdJpJIxe.b-3-public.qekafkatestcluster.7b9vtv.c13.kafka.us-east-1.amazonaws.com:9196,b-2-public.qekafkatestcluster.7b9vtv.c13.kafka.us-east-1.amazonaws.com:9196,b-1-public.qekafkatestcluster.7b9vtv.c13.kafka.us-east-1.amazonaws.com:9196(CB))[31]:TO |
java.lang.NullPointerException: Cannot invoke "java.util.concurrent.Future.cancel(boolean)" because "this.dataPollerFuture" is null |
at com.couchbase.analytics.adapter.TopicRecordReader.stop(TopicRecordReader.java:289) ~[columnar-connector.jar:1.0.0-2190] |
at org.apache.asterix.external.dataflow.FeedRecordDataFlowController.stop(FeedRecordDataFlowController.java:249) ~[asterix-external-data.jar:1.0.0-2190] |
at org.apache.asterix.external.dataset.adapter.FeedAdapter.stop(FeedAdapter.java:50) ~[asterix-external-data.jar:1.0.0-2190] |
at com.couchbase.analytics.runtime.TopicOperatorNodePushable.abort(TopicOperatorNodePushable.java:83) ~[columnar-connector.jar:1.0.0-2190] |
at org.apache.asterix.active.ActiveSourceOperatorNodePushable.stop(ActiveSourceOperatorNodePushable.java:68) ~[asterix-active.jar:1.0.0-2190] |
at org.apache.asterix.active.ActiveManager.stopIfRunning(ActiveManager.java:214) ~[asterix-active.jar:1.0.0-2190] |
at org.apache.asterix.active.ActiveManager.lambda$stopRuntime$3(ActiveManager.java:202) ~[asterix-active.jar:1.0.0-2190] |
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) [?:?] |
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) [?:?] |
at java.base/java.lang.Thread.run(Thread.java:840) [?:?] |
2024-07-03T07:29:07.646+00:00 INFO CBAS.adapter.TopicRecordReader [Executor-2001:448249404ea709b61341753fefde669e] Error while polling data from Kafka |
org.apache.kafka.common.errors.InterruptException: java.lang.InterruptedException
|
at org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.maybeThrowInterruptException(ConsumerNetworkClient.java:535) ~[kafka-clients-3.6.0.jar:?] |
at org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.poll(ConsumerNetworkClient.java:296) ~[kafka-clients-3.6.0.jar:?] |
at org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.poll(ConsumerNetworkClient.java:251) ~[kafka-clients-3.6.0.jar:?] |
at org.apache.kafka.clients.consumer.KafkaConsumer.pollForFetches(KafkaConsumer.java:1255) ~[kafka-clients-3.6.0.jar:?] |
at org.apache.kafka.clients.consumer.KafkaConsumer.poll(KafkaConsumer.java:1186) ~[kafka-clients-3.6.0.jar:?] |
at org.apache.kafka.clients.consumer.KafkaConsumer.poll(KafkaConsumer.java:1159) ~[kafka-clients-3.6.0.jar:?] |
at com.couchbase.analytics.adapter.TopicRecordReader.dataPoll(TopicRecordReader.java:365) [columnar-connector.jar:1.0.0-2190] |
at com.couchbase.analytics.adapter.TopicRecordReader$KafkaTopicDataPoller.run(TopicRecordReader.java:393) [columnar-connector.jar:1.0.0-2190] |
at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:539) [?:?] |
at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) [?:?] |
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) [?:?] |
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) [?:?] |
at java.base/java.lang.Thread.run(Thread.java:840) [?:?] |
Caused by: java.lang.InterruptedException
|
... 13 more |
cbcollect ->
https://cb-engineering.s3.amazonaws.com/SysTestColumnarJul3/collectinfo-2024-07-03T084841-ns_1%40svc-da-node-001.b2yoytucmykunsrf.sandbox.nonprod-project-avengers.com.zip
https://cb-engineering.s3.amazonaws.com/SysTestColumnarJul3/collectinfo-2024-07-03T084841-ns_1%40svc-da-node-002.b2yoytucmykunsrf.sandbox.nonprod-project-avengers.com.zip
https://cb-engineering.s3.amazonaws.com/SysTestColumnarJul3/collectinfo-2024-07-03T084841-ns_1%40svc-da-node-003.b2yoytucmykunsrf.sandbox.nonprod-project-avengers.com.zip
https://cb-engineering.s3.amazonaws.com/SysTestColumnarJul3/collectinfo-2024-07-03T084841-ns_1%40svc-da-node-004.b2yoytucmykunsrf.sandbox.nonprod-project-avengers.com.zip
Supportal ->