Monetdb clusters VS Distributed Query Processing

22 Sep 2015

      Hi,

I am trying to understand the main difference between clusters and the
remote table introduced in JUL2015 .

Correct me if I am wrong, in clusters the complete table is copied over all
nodes, and monetdb will do the query parallel execution on the nodes and
result concat. In the remote table I have more control on executing the
query on a part of a table on a selected node depending on resources
availables.

If this is true, are the following point valid:

   1. its better to have identical nodes for clusters
   2. the execution time is inversely proportional to the number of nodes
   in the clusters
   3. the executing time is dictated by the slowest node in cluster
   4. fail-over is by default supported in cluster configuration

One last question about the supported queries in these 2 modes(groupby,
order, limit).

if any, a reference to a documentation about the clustering is appreciated.

Thank you.

imad hajj chahine

Roberto Spadim

tags

participants (2)