On 18-12-2009 20:01:40 +0100, Bojan Šmid wrote:
I am thinking of using MonetDB as a data storage for my application. The application will create lots of data (few GB of new data each day) while there will not be many queries (probably few thousands/day). Queries will be complex, but they should execute really fast. Since data will grow pretty fast, I must prepare for scaling, so I need to know is there a way to distribute MonetDB over multiple machines?
At the moment there is no off-the-shelf implementation of any distributed/orchestrated version of MonetDB. That means, MonetDB itself doesn't manage anything like that currently, though it is not impossible to create distributed solutions, think of replicas or sharding (fragmentation). What /is/ around, is Merovingian, which can help you to implement any distributed solution. Merovingian can make access to MonetDB servers running on different machines transparent from any machine, allowing you you to use patterns to find MonetDB instances back in your cluster. See also the REMOTE DATABASES section in the merovingian man-page: http://homepages.cwi.nl/~fabian/MonetDB/Man%20Pages.html/merovingian.html