Content Comparison

...

KSQL is different to MySQL specifically, but similar level of variations to other SQL variants.
Cam notes need for caution in creating MySQL cache
Cam asks if users would be creating queries to run on the stream? Is there a limitation on how many of these we can support.
Cam would suggest favouring KSQL unless it proves unsuitable. Especially if Kafka is the Data Bus.
- https://docs.confluent.io/current/ksql/docs/capacity-planning.html
Dave M believes the web interface approach would allow people to use either, without extra work.

Dave M asked if it was worth considering Kafka for transferring RA/ DEC data to Sherlock.

Citus Data has produced a distributed, relational database based on PostgreSQL. Similar to Qserv.
- Cam noted that a fair portion of the code is open source.
- Also suggested Cockroach DB is potentially interesting.
Cam A noted that if query needs change, then your data model (in Cassandra) needs to change, and there is a risk it is not easy to change it. This has tended to push people away from NoSQL and back to relational databases.
Cam A believes group-key indexing should be possible in relational database.
Dave Y clarified that the Partition Key was first to be put into group key. Is that a concern, given that telescope scanning across the sky would typically lead to imbalance in load on database for cross-matching.
Andy L asked what the problem is that Cassandra is trying to solve:
- Ken believes intent for Cassandra is to distribute processing across multiple commodity nodes.
- Ken noted that there is a replication problem, which is not solved.
- Dave Y believes blob storage could help us tackle the scalability issues we have with MySQL.
- Ken suggests we could store light curves in Cassandra, using Object ID as the primary key.

Versions Compared