Sure, I will see what I can do.
Best wishes,
Joshua Shuai Yuan
On Wed, Sep 12, 2012 at 11:40 PM, Wang, Fusheng <fusheng.wang@emory.edu> wrote:
Hi Joshua,
The table we are managing is in the scale of 30K x 5K: 150M cells. If each table cell (double type) needs 8 bytes to represent (ideally), the space needed will be a couple of GB, or at most, in the scale of tens of GB. If that is the case, distributed setup may not be needed, as the data can be mostly cached by the database. MonetDB does support multi-cores and multi-disks, but setup across multiple machines is not supported based on my knowledge.
Even though MonetDB claims unlimited number of columns, we should still be cautious on what performance we can achieve for the queries we want to provide. A pilot study on this could provide us some guideline. Do you think you can setup MonetDB, and create a benchmark table we can do some performance study?
Interestingly, the large column issue is also discussed in a famous database blog:
http://www.dbms2.com/2011/03/13/so-how-many-columns-can-a-single-table-have-anyway/
One guy commented:
“Genomics models were the primary driver. these folks typically have short but wide datasets of the order of 300,000 columns.”
So we are not alone. And we have only a subset of the columns (30K versus 300K)? Then we are lucky!
Fusheng
From: Joshua Shuai Yuan [mailto:shuaiyuan.emory@gmail.com]
Sent: Wednesday, September 12, 2012 11:14 PM
To: Wang, Fusheng
Cc: Qin, Zhaohui
Subject: Re: Array database for large matrix
Hi Dr. Wang,
That's really a good news. Does it support distributed database? Or do we need distributed one?
Best wishes,
Joshua Shuai Yuan
On Wed, Sep 12, 2012 at 3:54 PM, Wang, Fusheng <fusheng.wang@emory.edu> wrote:
Hi guys,
It looks like the matrix structure can be nicely supported by array databases, like MonetDB. It supports unlimited number of columns for a table. It’s also open source.
http://www.monetdb.org/Home/Features
I know the group quite well, and I will chat with them on the use case to see if it’s a good fit.
Thanks,
Fusheng
This e-mail message (including any attachments) is for the sole use of
the intended recipient(s) and may contain confidential and privileged
information. If the reader of this message is not the intended
recipient, you are hereby notified that any dissemination, distribution
or copying of this message (including any attachments) is strictly
prohibited.
If you have received this message in error, please contact
the sender by reply e-mail message and destroy all copies of the
original message (including attachments).