Top #data Tools & Software

Explore 64 hand-picked tools and software tagged with data — ranked by popularity and community signals.

public-apis

github

A collective list of free APIs

Developer Tools Python
★ 423,594

openclaw

github

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

AI Tools TypeScript
★ 358,294

n8n

github

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

Automation TypeScript
★ 184,253

netdata

github

The fastest path to AI-powered full stack observability, even for lean teams.

DevOps C
★ 78,476

awesome-scalability

github

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

AI Tools
★ 70,422

scikit-learn

github

scikit-learn: machine learning in Python

AI Tools Python
★ 65,839

pathway

github

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Analytics Python
★ 63,508

30-Days-Of-Python

github

The 30 Days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than 100 days. Follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw

Database Python
★ 61,147

MinerU

github

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Developer Tools Python
★ 60,035

memos

github

Open-source, self-hosted note-taking tool built for quick capture. Markdown-native, lightweight, and fully yours.

Productivity Go
★ 58,956

TrendRadar

github

⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。

AI Tools Python
★ 51,708

etcd

github

Distributed reliable key-value store for the most critical data of a distributed system

Database Go
★ 51,638

llama_index

github

LlamaIndex is the leading document agent and OCR platform

AI Tools Python
★ 48,624

pandas

github

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Developer Tools Python
★ 48,515

airflow

github

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Automation Python
★ 45,058

streamlit

github

Streamlit — A faster way to build and share data apps.

AI Tools Python
★ 44,231

gradio

github

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

AI Tools Python
★ 42,340

DeepSpeed

github

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

AI Tools Python
★ 42,123

ColossalAI

github

Making large AI models cheaper, faster and more accessible

AI Tools Python
★ 41,367

BettaFish

github

微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。

AI Tools Python
★ 40,448

tidb

github

TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.

Database Go
★ 39,976

quivr

github

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.

AI Tools Python
★ 39,104

mindsdb

github

Query Engine for AI Analytics: Build self-reasoning agents across all your live data

AI Tools Python
★ 38,988

Scrapling

github

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!

AI Tools Python
★ 37,267