Top #big-data Tools & Software
Explore 8 hand-picked tools and software tagged with big-data โ ranked by popularity and community signals.
awesome-scalability
githubThe Patterns of Scalable, Reliable, and Performant Large-Scale Systems
ClickHouse
githubClickHouseยฎ is a real-time analytics database management system
data-science-ipython-notebooks
githubData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
gun
githubAn open source cybersecurity protocol for syncing decentralized graph data.
trino
githubOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
nebula
githubA distributed, fast open-source graph database featuring horizontal scalability and high availability
starrocks
githubThe world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
arkime
githubArkime is an open source, large scale, full packet capturing, indexing, and database system.