During some testing with the Hadoop connector, we found that the TAP backfill remaining stat would show, correctly, the number of items during the memory portion of the backfill and then would start to vary between 0 and 1 per node. The theory is that it's showing backfill per vbucket (the disk backfill) as number of backfills running after the memory backfill completes.
The full name of the stat is ep_tap_user_queue_backfillremaining.
This can be rather confusing, as users will see a huge spike in items being backfilled followed by, what appears to be, not much of anything.
The backfill runs correctly.
Recommended solution is to convert the backfill remaining stat to either exact or approximate number of items remaining and make it work correctly for both memory and vbucket stats.
|For Gerrit Dashboard: &For+MB-4813=message:MB-4813|
|13344,2||MB-4813 Maintain TAP backfill queue and backlog size separately.||ep-engine||Status: MERGED||+2||+1|