29 May
2009
29 May
'09
10:35 a.m.
How many tuples are in your 60GB dataset, i.e., your table "raw_data"? Of which datatype is attribute "keyword"? How many distinct values are there / do you expect in column "keyword"?
This depends on the amount and kind of data as well as the type of queries. While we manage to run 100GB TPC-H on a 8 GB machine, generally "the more the better" does apply...
There are aprox 28 million keywords (varchar(100)) out of 680 million row table. Yes I know 3GB is not ideal and I wasn't expecting blindingly fast queries until we bump up the memory, but I wasn't expecting the memory mapping issue. There's plenty of space on the disk - 151GB;