Top #Science Tools & Software
Explore 26 hand-picked tools and software tagged with Science โ ranked by popularity and community signals.
developer-roadmap
githubInteractive roadmaps, guides and other educational content to help developers grow in their careers.
PaddleOCR
githubTurn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
awesome-scalability
githubThe Patterns of Scalable, Reliable, and Performant Large-Scale Systems
scikit-learn
githubscikit-learn: machine learning in Python
keras
githubDeep Learning for humans
30-Days-Of-Python
githubThe 30 Days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than 100 days. Follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw
MinerU
githubTransforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
professional-programming
githubA collection of learning resources for curious software engineers
pandas
githubFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
streamlit
githubStreamlit โ A faster way to build and share data apps.
gradio
githubBuild and share delightful machine learning apps, all in Python. ๐ Star to support our work!
ray
githubRay is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
spaCy
github๐ซ Industrial-strength Natural Language Processing (NLP) in Python
ML-From-Scratch
githubMachine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
pytorch-lightning
githubPretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
data-science-ipython-notebooks
githubData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
d2l-en
githubInteractive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
prefect
githubPrefect is a workflow orchestration framework for building resilient data pipelines in Python.
DevOps-Roadmap
githubDevOps Roadmap for 2026. with learning resources
taipy
githubTurns Data and AI algorithms into production-ready web applications in no time.
awesome-mlops
githubA curated list of references for MLOps
RD-Agent
githubResearch and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through R&D-Agent, which lets AI drive data-driven AI. ๐https://aka.ms/RD-Agent-Tech-Report
tpot
githubA Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
ml-intern
producthuntHugging Face's AI agent that automates post-training