Skip to content
#

apache-cassandra

cassandra logo

Apache Cassandra is a free, open source, distributed, wide column store, NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.

Here are 114 public repositories matching this topic...

Provides a scaffold to easily build a cluster to query the data from ESA's Gaia satellite. Gaia is an ambitious mission to chart a three-dimensional map of our Galaxy, the Milky Way. Gaia will provide unprecedented positional and radial velocity measurements with the accuracies needed to produce a stereoscopic and kinematic census of about one b…

  • Updated Mar 12, 2017
  • Java

It is a project where I applied concepts data modelling with Apache Cassandra and built an ETL pipeline using Python. To complete the project has been defined a data model by creating tables in Apache Cassandra to run queries. I am provided with part of the ETL pipeline that transfers data from a set of CSV files within a directory to create a s…

  • Updated Dec 15, 2021
  • Jupyter Notebook

This project implemented Apache Cassandra data modelling to support Sparkify's analysis of user activity and song play data. It involved consolidating partitioned files into a single CSV, designing and creating tables based on specific queries from Sparkify’s analytics team, and inserting the data from the CSV into the tables using CQL commands.

  • Updated Sep 10, 2024
  • Jupyter Notebook

Created by Apache Software Foundation

Released July 2008

Followers
127 followers
Repository
apache/cassandra
Website
cassandra.apache.org
Wikipedia
Wikipedia

Related Topics

dotnet language