Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Critical
Fix Version/s: 4.5.1, 5.0.0
Affects Version/s: master
Component/s: forestdb
Labels:
None
Environment:
iOS 64-bit

Triage:
Untriaged
Operating System:
MacOSX 64-bit
Is this a Regression?:
Yes
Sprint:
ForestDB: Oct 17 - Nov 4

Description

My unofficial Couchbase Lite performance test shows our iOS 1.3 candidate being only 60% of the speed of 1.2.1 when using ForestDB storage. (On my iPhone 6 the test time went from 16 sec to 27 sec.) SQLite performance is unaffected (still 21 sec). This means that ForestDB storage is now slower than SQLite

I ran the test in Apple’s Instruments tool and found that 30% of the total run time of the test is being spent in `malloc`, `free` and `memcpy` calls made by a handful of ForestDB functions:

10.3% `free` calls in _hbtrie_find
5.7% `malloc` calls in _hbtrie_find
4.6% `memcpy` calls in _docio_read_doc_component
3.8% `free` calls in _hbtrie_insert
2.9% `memcpy` calls in _hbtrie_reform_key
2.5% `malloc` calls in _hbtrie_insert
____
29.8%

memcpy has always been the single hottest function when running this benchmark, but the malloc/free overhead is new. It seems to come from these lines found in both _hbtrie_find and _hbtrie_insert:

uint8_t *docrawkey = (uint8_t *) malloc(HBTRIE_MAX_KEYLEN);
uint8_t *dockey = (uint8_t *) malloc(HBTRIE_MAX_KEYLEN);
...
free(docrawkey);
free(dockey);

The value of HBTRIE_MAX_KEYLEN is 65536. It appears that on iOS (and macOS?) requests this large go through a slower code path in malloc/free, which uses a Mach system call (mach_vm_map, mach_vm_deallocate) to directly map the address space from VM. Regardless, it would be good to avoid any memory allocation in a hot code path like this.

Would it be possible to use a per-handle or per-thread buffer instead? Or at least to allocate only as much memory as needed for the key being read?

Attachments

Issue Links

blocks

MB-19612 4.5.1 Minor Release

Closed

relates to

MB-20231 Implement a global memory pool to be used by hbtrie's find and insert operations

Closed

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

For Gerrit Dashboard: MB-20219
#	Subject	Branch	Project	Status	CR	V
66027,3	MB-20219: Stick to heap allocation just for windows	master	forestdb	Status: MERGED	+2	+1
66029,2	[BP] MB-20219: Stick to heap allocation just for windows	watson	forestdb	Status: MERGED	+2	+1
66030,2	Merge remote-tracking branch 'couchbase/watson' into stable	stable	forestdb	Status: MERGED	+2	+1

Activity

People

Assignee:: Jens Alfke

Reporter:: Jens Alfke

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 20/Jul/16 4:09 PM

Updated:: 14/Jan/17 2:02 AM

Resolved:: 20/Jul/16 8:53 PM

Gerrit Reviews

There are no open Gerrit changes

Show There are 3 closed Gerrit changes

Hide There are 3 closed Gerrit changes

MB-20219: Stick to heap allocation just for windows: Gerrit Review:

[BP] MB-20219: Stick to heap allocation just for windows: Gerrit Review:

Merge remote-tracking branch 'couchbase/watson' into stable: Gerrit Review:

60% slowdown of CBL benchmark on iOS, due to ForestDB malloc/free calls

Details

Description

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty