Improve INSERT-per-second performance of SQLite

On bulk inserts

Inspired by this post and by the Stack Overflow question that led me here -- Is it possible to insert multiple rows at a time in an SQLite database? -- I've posted my first Git repository:

~~https://github.com/rdpoor/CreateOrUpdate~~

which bulk loads an array of ActiveRecords into MySQL, SQLite or PostgreSQL databases. It includes an option to ignore existing records, overwrite them or raise an error. My rudimentary benchmarks show a 10x speed improvement compared to sequential writes -- YMMV.

I'm using it in production code where I frequently need to import large datasets, and I'm pretty happy with it.

Bulk imports seems to perform best if you can chunk your INSERT/UPDATE statements. A value of 10,000 or so has worked well for me on a table with only a few rows, YMMV...

If you care only about reading, somewhat faster (but might read stale data) version is to read from multiple connections from multiple threads (connection per-thread).

First find the items, in the table:

SELECT COUNT(*) FROM table

then read in pages (LIMIT/OFFSET):

SELECT * FROM table ORDER BY _ROWID_ LIMIT <limit> OFFSET <offset>

where and are calculated per-thread, like this:

int limit = (count + n_threads - 1)/n_threads;

for each thread:

int offset = thread_index * limit

For our small (200mb) db this made 50-75% speed-up (3.8.0.2 64-bit on Windows 7). Our tables are heavily non-normalized (1000-1500 columns, roughly 100,000 or more rows).

Too many or too little threads won't do it, you need to benchmark and profile yourself.

Also for us, SHAREDCACHE made the performance slower, so I manually put PRIVATECACHE (cause it was enabled globally for us)

Related questions
                            
                                typedef struct vs struct definitions [duplicate]
                            
                                What is the difference between a definition and a declaration?
                            
                                Obfuscated C Code Contest 2006. Please explain sykes2.c
                            
                                What is the difference between ++i and i++?
                            
                                How to initialize all members of an array to the same value?
                            
                                Why does the C preprocessor interpret the word "linux" as the constant "1"?
                            
                                How do I use extern to share variables between source files?
                            
                                How do I determine the size of my array in C?
                            
                                What does "static" mean in C?
                            
                                How do function pointers in C work?
                            
                                Compiling an application for use in highly radioactive environments
                            
                                What is the difference between const int*, const int * const, and int const *?
                            
                                Is < faster than <=?
                            
                                With arrays, why is it the case that a[5] == 5[a]?
                            
                                What is ":-!!" in C code?
                            
                                What is the effect of extern "C" in C++?
                            
                                What does the ??!??! operator do in C?
                            
                                Do I cast the result of malloc?
                            
                                What is the difference between #include <filename> and #include "filename"?
                            
                                How do you set, clear, and toggle a single bit?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Improve INSERT-per-second performance of SQLite

Tags:

performance

c

optimization

sqlite

People also ask

On bulk inserts

Recent Activity

Donate For Us