Re: Bulk Import

14 Sep 2013

      Hi,

I ran into a similar problem maybe two weeks ago and was tactfully
reminded to check that the DB is in maintenance mode. Otherwise, the WAL
will be too long too read (that's what crashed monetdb for me).

In my case, there was no need for any additional parameters and I could
load ten times 1B rows into ten tables without any problems. Admittedly,
our machine has a lot of HDD and RAM.

Ralph

On 09/13/2013 05:38 PM, Thomas Johann wrote:
...
Hi everybody,
I have a problem inserting a large amount of data into a monetdb
database using bulk import.
I'm running the following commands in a loop:
connection.execute("COPY *19000000* OFFSET 2 RECORDS INTO XXX FROM '" +
csv + "' USING DELIMITERS ';','\n' ;")
connection.commit()
, where csv is a different csv-file in each round of the loop, but
always containing 18000001 rows of data.
To be sure that enough memory is allocated I chose 19000000 in the
execute command.
I now have two questions:
1. Should the number of records (here 19000000) represent the number of
lines per .csv-file or the number of lines of the final database (number
of csv-files * 18Mio.)???
2. Can you think of any reason why monetdb would stop reading one
specific variable, while continuing to read the others? Let's say my csv
has 8 columns and 18000000 rows with no missing values in the raw data.
Until Row 16537472 the total data is read-in, but for the following
lines variable 3 is missing until line 18000000 while variable 1 as well
as 3-8 are perfectly fine. Can this be due to memory or harddisk speed
constraints? Why is no error message raised?
It would be great if someone could help me.
Thanks,
Thomas
_______________________________________________
users-list mailing list
users-list@monetdb.org
http://mail.monetdb.org/mailman/listinfo/users-list
-- 
Ralph Holz
I8 - Network Architectures and Services
Technische Universität München
http://www.net.in.tum.de/de/mitarbeiter/holz/
Phone +49.89.289.18043
PGP: A805 D19C E23E 6BBB E0C4  86DC 520E 0C83 69B0 03EF