Top #data-science Tools & Software

Explore 17 hand-picked tools and software tagged with data-science — ranked by popularity and community signals.

scikit-learn

github

scikit-learn: machine learning in Python

AI Tools Python
★ 65,839

keras

github

Deep Learning for humans

AI Tools Python
★ 64,008

30-Days-Of-Python

github

The 30 Days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than 100 days. Follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw

Database Python
★ 61,147

pandas

github

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Developer Tools Python
★ 48,515

airflow

github

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Automation Python
★ 45,058

streamlit

github

Streamlit — A faster way to build and share data apps.

AI Tools Python
★ 44,231

gradio

github

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

AI Tools Python
★ 42,340

ray

github

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

AI Tools Python
★ 42,145

spaCy

github

💫 Industrial-strength Natural Language Processing (NLP) in Python

AI Tools Python
★ 33,472

ML-From-Scratch

github

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

AI Tools Python
★ 31,306

pytorch-lightning

github

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

AI Tools Python
★ 31,051

data-science-ipython-notebooks

github

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

AI Tools Python
★ 29,002

d2l-en

github

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

AI Tools Python
★ 28,623

prefect

github

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Automation Python
★ 22,183

awesome-mlops

github

A curated list of references for MLOps

AI Tools
★ 13,854

RD-Agent

github

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through R&D-Agent, which lets AI drive data-driven AI. 🔗https://aka.ms/RD-Agent-Tech-Report

AI Tools Python
★ 12,531

tpot

github

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

AI Tools
★ 10,041