Top #hadoop Tools & Software
Explore 4 hand-picked tools and software tagged with hadoop — ranked by popularity and community signals.
seaweedfs
githubSeaweedFS is a distributed storage system for object storage (S3), file systems, and Iceberg tables, designed to handle billions of files with O(1) disk access and effortless horizontal scaling.
data-science-ipython-notebooks
githubData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
trino
githubOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
school-of-sre
githubAt LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.