New subject: Improving performance of "select * from table" ?

17 Oct 2012

      Hi Gordon,

indeed, your experiments confirm that most of the time is spent in serializing, sending, and re-parsing your queries huge result.

I am wondering what kind of application needs to see the entire database table?
No user can seriously handle millions of result tuples with tens (or more) of columns?
What is the purpose of using a DBMS if all it needs to do is select * from table ?

In my/our opinion, as much as possible of the application logic should be expressed in SQL (plus domain-/application specific extension, if required) and (thus) performed in the DBMS, resulting in only small results that need to be sent back to the client/application. Then the efficiency and performance of the DBMS (server) itself is dominating and more important than the server-client communication.

Best,
Stefan

Assaf Gordon  wrote:

Hello Stefan,
...
Stefan de Konink wrote, On 10/16/2012 06:49 PM:
...
[...]
What about: the time taken to serialise the results? For MonetDB this is done in a very 'plain text' wire-format.
[...]
Are you aware of the PostgreSQL \timing command? In that way you can see outputs similar to that of MonetDB with respect of time taken for a query.
I re-run one test using psql's and mclient's internal timing.

===
## Postgres: Select all columns from table
$ psql test            
test=# \o /dev/null
test=# \timing 
Timing is on.
test=# select * from bc251;
Time: 5506.768 ms
test=# select * from bc251;
Time: 5446.689 ms
test=# 

## MonetDB: Select all columns from table
$ mclient -d voc
sql>\>/dev/null
sql>\w-1                
sql>select * from bc251;
1657032 tuples (3.8s)
sql>select * from bc251;
1657032 tuples (3.8s)
sql>
===

As you said, the reported durations are much shorter than in my previous timings (monetDb: 3.8s vs 2m16s, postgres 5.5s vs 21s).

However, those numbers don't reveal the entire story: mclient reports 3.8 seconds, but it took more than 2 minutes (of wall time) for the result message to be printed.

I understand that this quite possibly indicates a serialization/transfer bottle-neck, and it could be of lesser interest to the developers,
but as an application developer I worry about the end-to-end timing of the query - the time it takes for my application to get the data back.

If there are any recommendation on improving those queries, it would be much appreciated.

Thanks,
 -gordon
_______________________________________________
users-list mailing list
users-list@monetdb.org
http://mail.monetdb.org/mailman/listinfo/users-list
_______________________________________________
users-list mailing list
users-list@monetdb.org
http://mail.monetdb.org/mailman/listinfo/users-list

Re: Improving performance of "select * from table" ?

Stefan Manegold

Assaf Gordon

Stefan Manegold

Assaf Gordon

Niels Nes

tags

participants (3)