Files
awesome-python/README.md
Vinta Chen 0183bf15ae refactor: move entries to more accurate categories
- flower -> Task Queues (Celery sub-entry, not Admin Panels)
- vllm, rasa, diffusers, transformers -> AI Agents (not Deep Learning / ML)
- ccb -> AI Agents (not CLI Productivity Tools)
- code-graph-rag -> AI Agents (not Code Analysis)
- kafka-python -> Distributed Computing / Stream Processing (not NoSQL Databases)
- scapy -> Networking (not Hardware)
- sentry-python -> DevOps / Monitoring (not Logging)
- joblib -> Distributed Computing (not Job Scheduler)
- doit -> Build Tools (not Job Scheduler)
- karateclub -> Machine Learning (not Science)
- numba -> Science (not Python Implementations)
- diagrams -> Documentation (not Data Visualization)
- mkdocs -> Documentation (not Static Site Generators)
- pyelftools -> Text Formats / General (not Debugging)
- weasyprint -> Text Formats / PDF (not HTML/XML)
- webargs -> RESTful API (not URL Parsing)
- kafka-python stream sub-entry added, flower celery sub-entry added

Co-Authored-By: Claude <noreply@anthropic.com>
2026-03-20 17:46:54 +08:00

82 KiB
Raw Blame History

Awesome Python

An opinionated list of awesome Python frameworks, libraries, tools, software and resources.

The #10 most-starred repo on GitHub. Put your product where Python developers discover tools. Become a sponsor.

Categories


Admin Panels

Libraries for administrative interfaces.

  • ajenti - The admin panel your servers deserve.
  • django-grappelli - A jazzy skin for the Django Admin-Interface.
  • django-unfold - Elevate your Django admin with a stunning modern interface, powerful features, and seamless user experience.
  • flask-admin - Simple and extensible administrative interface framework for Flask.
  • func-to-web - Instantly create web UIs from Python functions using type hints. Zero frontend code required.
  • jet-bridge - Admin panel framework for any application with nice UI (ex Jet Django).
  • wooey - A Django app which creates automatic web UIs for Python scripts.

AI and Agents

Libraries for building AI applications, LLM integrations, and autonomous agents.

  • agno - Open-source Python library for building AI agents and agentic systems.
  • autogen - A programming framework for building agentic AI applications.
  • autoresearch - AI agents running autonomous research experiments on single-GPU LLM training.
  • crewai - A framework for orchestrating role-playing autonomous AI agents for collaborative task solving.
  • dspy - A framework for programming, not prompting, language models.
  • instructor - A library for extracting structured data from LLMs, powered by Pydantic.
  • langchain - Building applications with LLMs through composability.
  • llama_index - A data framework for your LLM application.
  • praisonai - Production-ready Multi-AI Agents framework with self-reflection, 100+ LLM support, MCP integration, and agentic workflows.
  • pydantic-ai - A Python agent framework for building generative AI applications with structured schemas.
  • ragflow - An open-source RAG engine for document understanding and question answering with LLMs.
  • ccb - A CLI tool to orchestrate multiple LLMs (Claude, Gemini, etc.) in Tmux panes with cross-agent interaction.
  • code-graph-rag - Builds knowledge graphs from multi-language codebases using Tree-sitter and Memgraph, enabling natural language querying of code structure.
  • diffusers - A library that provides pretrained diffusion models for generating and editing images, audio, and video.
  • rasa - An open-source machine learning framework for automated text and voice-based conversations.
  • smolagents - A lightweight library for building agents that reason and act through code.
  • transformers - A framework that lets you easily use pretrained transformer models for NLP, vision, and audio tasks.
  • vllm - A high-throughput and memory-efficient inference and serving engine for LLMs.

Algorithms and Design Patterns

Python implementation of data structures, algorithms and design patterns. Also see awesome-algorithms.

  • Algorithms
    • algorithms - Minimal examples of data structures and algorithms.
    • python-ds - A collection of data structure and algorithms for coding interviews.
    • sortedcontainers - Fast and pure-Python implementation of sorted collections.
    • thealgorithms - All Algorithms implemented in Python.
  • Design Patterns
    • python-cqrs - Event-Driven Architecture Framework with CQRS/CQS, Transaction Outbox, Saga orchestration.
    • python-patterns - A collection of design patterns in Python.
    • transitions - A lightweight, object-oriented finite state machine implementation.

Web Servers

ASGI and WSGI compatible web servers.

  • ASGI
    • daphne - A HTTP, HTTP2 and WebSocket protocol server for ASGI and ASGI-HTTP.
    • granian - A Rust HTTP server for Python applications built on top of Hyper and Tokio, supporting WSGI/ASGI/RSGI.
    • hypercorn - An ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.
    • uvicorn - A lightning-fast ASGI server implementation, using uvloop and httptools.
  • WSGI
    • gunicorn - Pre-forked, ported from Ruby's Unicorn project.
    • uwsgi - A project aims at developing a full stack for building hosting services, written in C.
    • waitress - Multi-threaded, powers Pyramid.
    • werkzeug - A WSGI utility library for Python that powers Flask and can easily be embedded into your own projects.

Asynchronous Programming

Libraries for asynchronous, concurrent and parallel execution. Also see awesome-asyncio.

  • asyncio - (Python standard library) Asynchronous I/O, event loop, coroutines and tasks.
  • concurrent.futures - (Python standard library) A high-level interface for asynchronously executing callables.
  • gevent - A coroutine-based Python networking library that uses greenlet.
  • multiprocessing - (Python standard library) Process-based parallelism.
  • trio - A friendly library for async concurrency and I/O.
  • twisted - An event-driven networking engine.
  • uvloop - Ultra fast asyncio event loop.

Audio

Libraries for manipulating audio and its metadata.

  • Audio
    • audioflux - A library for audio and music analysis, feature extraction.
    • audioread - Cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.
    • dejavu - Audio fingerprinting and recognition.
    • gtts - Python library and CLI tool for converting text to speech using Google Translate TTS.
    • kapre - Keras Audio Preprocessors.
    • librosa - Python library for audio and music analysis.
    • matchering - A library for automated reference audio mastering.
    • mingus - An advanced music theory and notation package with MIDI file and playback support.
    • pyAudioAnalysis - Audio feature extraction, classification, segmentation and applications.
    • pydub - Manipulate audio with a simple and easy high level interface.
    • TimeSide - Open web audio processing framework.
  • Metadata
    • beets - A music library manager and MusicBrainz tagger.
    • eyed3 - A tool for working with audio files, specifically MP3 files containing ID3 metadata.
    • mutagen - A Python module to handle audio metadata.
    • tinytag - A library for reading music meta data of MP3, OGG, FLAC and Wave files.

Authentication

Libraries for implementing authentication schemes.

  • OAuth
    • authlib - JavaScript Object Signing and Encryption draft implementation.
    • django-allauth - Authentication app for Django that "just works."
    • django-oauth-toolkit - OAuth 2 goodies for Django.
    • oauthlib - A generic and thorough implementation of the OAuth request-signing logic.
  • JWT
    • pyjwt - JSON Web Token implementation in Python.
    • python-jose - A JOSE implementation in Python.
  • Permissions
    • django-guardian - Implementation of per object permissions for Django 1.2+
    • django-rules - A tiny but powerful app providing object-level permissions to Django, without requiring a database.

Build Tools

Compile software from source code.

  • bitbake - A make-like build tool for embedded Linux.
  • buildout - A build system for creating, assembling and deploying applications from multiple parts.
  • invoke - A tool for managing shell-oriented subprocesses and organizing executable Python code into CLI-invokable tasks.
  • platformio - A console tool to build code with different development platforms.
  • pybuilder - A continuous build tool written in pure Python.
  • doit - A task runner and build tool.
  • scons - A software construction tool.

Built-in Classes Enhancement

Libraries for enhancing Python built-in classes.

  • attrs - Replacement for __init__, __eq__, __repr__, etc. boilerplate in class definitions.
  • bidict - Efficient, Pythonic bidirectional map data structures and related functionality.
  • box - Python dictionaries with advanced dot notation access.

CMS

Content Management Systems.

  • django-cms - The easy-to-use and developer-friendly enterprise CMS powered by Django.
  • feincms - One of the most advanced Content Management Systems built on Django.
  • indico - A feature-rich event management system, made @ CERN.
  • wagtail - A Django content management system.

Caching

Libraries for caching data.

  • beaker - A WSGI middleware for sessions and caching.
  • django-cache-machine - Automatic caching and invalidation for Django models.
  • django-cacheops - A slick ORM cache with automatic granular event-driven invalidation.
  • dogpile.cache - dogpile.cache is a next generation replacement for Beaker made by the same authors.
  • hermescache - Python caching library with tag-based invalidation and dogpile effect prevention.
  • pylibmc - A Python wrapper around the libmemcached interface.
  • python-diskcache - SQLite and file backed cache backend with faster lookups than memcached and redis.

Code Analysis

Tools of static analysis, linters and code quality checkers. Also see awesome-static-analysis.

  • Code Analysis
    • code2flow - Turn your Python and JavaScript code into DOT flowcharts.
    • prospector - A tool to analyze Python code.
    • vulture - A tool for finding and analyzing dead Python code.
  • Code Linters
    • bandit - A tool designed to find common security issues in Python code.
    • flake8 - A wrapper around pycodestyle, pyflakes and McCabe.
    • pylint - A fully customizable source code analyzer.
    • ruff - An extremely fast Python linter and code formatter.
  • Code Formatters
    • black - The uncompromising Python code formatter.
    • isort - A Python utility / library to sort imports.
    • yapf - Yet another Python code formatter from Google.
  • Static Type Checkers, also see awesome-python-typing
    • mypy - Check variable types during compile time.
    • pyre-check - Performant type checking.
    • ty - An extremely fast Python type checker and language server.
    • typeshed - Collection of library stubs for Python, with static types.
  • Refactoring
    • bowler - Safe code refactoring for modern Python.
    • rope - Rope is a python refactoring library.
  • Static Type Annotations Generators
    • monkeytype - A system for Python that generates static type annotations by collecting runtime types.
    • pytype - Pytype checks and infers types for Python code - without requiring type annotations.

Command-line Interface Development

Libraries for building command-line applications.

  • Command-line Application Development
    • argparse - (Python standard library) Command-line option and argument parsing.
    • cement - CLI Application Framework for Python.
    • click - A package for creating beautiful command line interfaces in a composable way.
    • cliff - A framework for creating command-line programs with multi-level commands.
    • python-fire - A library for creating command line interfaces from absolutely any Python object.
    • python-prompt-toolkit - A library for building powerful interactive command lines.
    • typer - Modern CLI framework that uses Python type hints. Built on Click and Pydantic.
  • Terminal Rendering
    • alive-progress - A new kind of Progress Bar, with real-time throughput, eta and very cool animations.
    • asciimatics - A package to create full-screen text UIs (from interactive forms to ASCII animations).
    • bashplotlib - Making basic plots in the terminal.
    • colorama - Cross-platform colored terminal text.
    • rich - Python library for rich text and beautiful formatting in the terminal. Also provides a great RichHandler log handler.
    • textual - A framework for building interactive user interfaces that run in the terminal and the browser.
    • tqdm - Fast, extensible progress bar for loops and CLI.

Command-line Tools

Useful CLI-based tools for productivity.

  • Productivity Tools
    • cookiecutter - A command-line utility that creates projects from cookiecutters (project templates).
    • copier - A library and command-line utility for rendering projects templates.
    • doitlive - A tool for live presentations in the terminal.
    • fpp - Select files out of bash output.
    • thefuck - Correcting your previous console command.
    • tmuxp - A tmux session manager.
    • xonsh - A Python-powered shell. Full-featured and cross-platform.
  • CLI Enhancements
    • httpie - A command line HTTP client, a user-friendly cURL replacement.
    • iredis - Redis CLI with autocompletion and syntax highlighting.
    • litecli - SQLite CLI with autocompletion and syntax highlighting.
    • mycli - MySQL CLI with autocompletion and syntax highlighting.
    • pgcli - PostgreSQL CLI with autocompletion and syntax highlighting.

Computer Vision

Libraries for Computer Vision.

  • easyocr - Ready-to-use OCR with 40+ languages supported.
  • kornia - Open Source Differentiable Computer Vision Library for PyTorch.
  • opencv - Open Source Computer Vision Library.
  • pytesseract - A wrapper for Google Tesseract OCR.
  • tesserocr - Another simple, Pillow-friendly, wrapper around the tesseract-ocr API for OCR.

Configuration Files

Libraries for storing and parsing configuration options.

  • configobj - INI file parser with validation.
  • configparser - (Python standard library) INI file parser.
  • dynaconf - Dynaconf is a configuration manager with plugins for Django, Flask and FastAPI.
  • hydra - Hydra is a framework for elegantly configuring complex applications.
  • python-decouple - Strict separation of settings from code.

Cryptography

  • cryptography - A package designed to expose cryptographic primitives and recipes to Python developers.
  • paramiko - The leading native Python SSHv2 protocol library.
  • pynacl - Python binding to the Networking and Cryptography (NaCl) library.

Data Analysis

Libraries for data analysis.

  • aws-sdk-pandas - Pandas on AWS.
  • datasette - An open source multi-tool for exploring and publishing data.
  • desbordante - An open source data profiler for complex pattern discovery.
  • optimus - Agile Data Science Workflows made easy with PySpark.
  • pandas - A library providing high-performance, easy-to-use data structures and data analysis tools.
  • pathway - Real-time data processing framework for Python with reactive dataflows.
  • polars - A fast DataFrame library implemented in Rust with a Python API.

Data Validation

Libraries for validating data. Used for forms in many cases.

  • cerberus - A lightweight and extensible data validation library.
  • colander - Validating and deserializing data obtained via XML, JSON, an HTML form post.
  • jsonschema - An implementation of JSON Schema for Python.
  • pydantic - Data validation using Python type hints.
  • schema - A library for validating Python data structures.
  • schematics - Data Structure Validation.
  • voluptuous - A Python data validation library.

Data Visualization

Libraries for visualizing data. Also see awesome-javascript.

  • altair - Declarative statistical visualization library for Python.
  • bokeh - Interactive Web Plotting for Python.
  • bqplot - Interactive Plotting Library for the Jupyter Notebook.
  • cartopy - A cartographic python library with matplotlib support.
  • gradio - Build and share machine learning apps, all in Python.
  • matplotlib - A Python 2D plotting library.
  • plotly - Interactive graphing library for Python.
  • plotnine - A grammar of graphics for Python based on ggplot2.
  • pygal - A Python SVG Charts Creator.
  • pygraphviz - Python interface to Graphviz.
  • pyqtgraph - Interactive and realtime 2D/3D/Image plotting and science/engineering widgets.
  • seaborn - Statistical data visualization using Matplotlib.
  • streamlit - A framework which lets you build dashboards, generate reports, or create chat apps in minutes.
  • ultraplot - Matplotlib wrapper for publication-ready scientific figures with minimal code. Includes advanced subplot management, panel layouts, and batteries-included geoscience plotting.
  • vispy - High-performance scientific visualization based on OpenGL.

Database

Databases implemented in Python.

  • chromadb - An open-source embedding database for building AI applications with embeddings and semantic search.
  • duckdb - An in-process SQL OLAP database management system; optimized for analytics and fast queries, similar to SQLite but for analytical workloads.
  • pickledb - A simple and lightweight key-value store for Python.
  • tinydb - A tiny, document-oriented database.
  • ZODB - A native object database for Python. A key-value and object graph database.

Database Drivers

Libraries for connecting and operating databases.

Date and Time

Libraries for working with dates and times.

  • arrow - A Python library that offers a sensible and human-friendly approach to creating, manipulating, formatting and converting dates, times and timestamps.
  • dateutil - Extensions to the standard Python datetime module.
  • pendulum - Python datetimes made easy.
  • zoneinfo - (Python standard library) IANA time zone support. Brings the tz database into Python.

Debugging Tools

Libraries for debugging code.

  • pdb-like Debugger
    • ipdb - IPython-enabled pdb.
    • pudb - A full-screen, console-based Python debugger.
  • Tracing
    • manhole - Debugging UNIX socket connections and present the stacktraces for all threads and an interactive prompt.
    • python-hunter - A flexible code tracing toolkit.
  • Profiler
    • py-spy - A sampling profiler for Python programs. Written in Rust.
    • vprof - Visual Python profiler.
  • Others
    • django-debug-toolbar - Display various debug information for Django.
    • flask-debugtoolbar - A port of the django-debug-toolbar to flask.
    • icecream - Inspect variables, expressions, and program execution with a single, simple function call.
    • memory_graph - Visualize Python data at runtime to debug references, mutability, and aliasing.

Deep Learning

Frameworks for Neural Networks and Deep Learning. Also see awesome-deep-learning.

  • jax - a library for high-performance numerical computing with automatic differentiation and JIT compilation.
  • keras - A high-level neural networks library and capable of running on top of either TensorFlow or Theano.
  • pytorch-lightning - Deep learning framework to train, deploy, and ship AI products Lightning fast.
  • pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration.
  • stable-baselines3 - PyTorch implementations of Stable Baselines (deep) reinforcement learning algorithms.
  • tensorflow - The most popular Deep Learning framework created by Google.

DevOps Tools

Software and libraries for DevOps.

  • Cloud Providers
    • boto3 - Python interface to Amazon Web Services.
  • Configuration Management
    • ansible - A radically simple IT automation platform.
    • cloudinit - A multi-distribution package that handles early initialization of a cloud instance.
    • openstack - Open source software for building private and public clouds.
    • pyinfra - A versatile CLI tools and python libraries to automate infrastructure.
    • saltstack - Infrastructure automation and management system.
  • SSH-style Deployment
    • fabric - A simple, Pythonic tool for remote execution and deployment.
  • Process Management
    • supervisor - Supervisor process control system for UNIX.
  • Monitoring
    • psutil - A cross-platform process and system utilities module.
    • sentry-python - Sentry SDK for Python.
  • Git Hooks
    • pre-commit - A framework for managing and maintaining multi-language pre-commit hooks.
  • Backup
    • borg - A deduplicating archiver with compression and encryption.
  • Serverless
    • python-lambda - A toolkit for developing and deploying Python code in AWS Lambda.
    • zappa - A tool for deploying WSGI applications on AWS Lambda and API Gateway.
  • Chaos Engineering
    • chaostoolkit - A Chaos Engineering toolkit & Orchestration for Developers.

Distributed Computing

Frameworks and libraries for Distributed Computing.

  • Batch Processing
    • dask - A flexible parallel computing library for analytic computing.
    • luigi - A module that helps you build complex pipelines of batch jobs.
    • mpi4py - Python bindings for MPI.
    • pyspark - Apache Spark Python API.
    • joblib - A set of tools to provide lightweight pipelining in Python.
    • ray - A system for parallel and distributed Python that unifies the machine learning ecosystem.
  • Stream Processing

Distribution

Libraries to create packaged executables for release distribution.

  • cx-Freeze - It is a Python tool that converts Python scripts into standalone executables and installers for Windows, macOS, and Linux.
  • Nuitka - Compiles Python programs into high-performance standalone executables (cross-platform, supports all Python versions).
  • py2app - Freezes Python scripts (Mac OS X).
  • py2exe - Freezes Python scripts (Windows).
  • pyarmor - A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.
  • pyinstaller - Converts Python programs into stand-alone executables (cross-platform).
  • shiv - A command line utility for building fully self-contained zipapps (PEP 441), but with all their dependencies included.

Documentation

Libraries for generating project documentation.

  • sphinx - Python Documentation generator.
  • diagrams - Diagram as Code.
  • mkdocs - Markdown friendly documentation generator.
  • pdoc - Epydoc replacement to auto generate API documentation for Python libraries.

Downloader

Libraries for downloading.

  • General
    • s3cmd - A command line tool for managing Amazon S3 and CloudFront.
    • youtube-dl - A command-line program to download videos from YouTube and other video sites.
  • Financial Data
    • akshare - A financial data interface library, built for human beings!
    • edgartools - Library for downloading structured data from SEC EDGAR filings and XBRL financial statements.
    • openbb - A financial data platform for analysts, quants and AI agents.
    • yfinance - Easy Pythonic way to download market and financial data from Yahoo Finance.

Editor Plugins and IDEs

  • Emacs
    • elpy - Emacs Python Development Environment.
  • Vim
    • jedi-vim - Vim bindings for the Jedi auto-completion library for Python.
    • python-mode - An all in one plugin for turning Vim into a Python IDE.
    • YouCompleteMe - Includes Jedi-based completion engine for Python.
  • Visual Studio
    • PTVS - Python Tools for Visual Studio.
  • Visual Studio Code
  • IDE
    • PyCharm - Commercial Python IDE by JetBrains. Has free community edition available.
    • spyder - Open Source Python IDE.

Email

Libraries for sending and parsing email.

  • Mail Servers
    • modoboa - A mail hosting and management platform including a modern Web UI.
  • Clients
    • imbox - Python IMAP for Humans.
    • yagmail - Yet another Gmail/SMTP client.
  • Others
    • flanker - An email address and Mime parsing library.
    • mailer - High-performance extensible mail delivery framework.

Environment Management

Libraries for Python version and virtual environment management.

  • pyenv - Simple Python version management.
    • pyenv-win - Pyenv for Windows, Simple Python version management.
  • uv - An extremely fast Python version, package and project manager, written in Rust.
  • virtualenv - A tool to create isolated Python environments.

File Manipulation

Libraries for file manipulation.

  • mimetypes - (Python standard library) Map filenames to MIME types.
  • path.py - A module wrapper for os.path.
  • pathlib - (Python standard library) A cross-platform, object-oriented path library.
  • python-magic - A Python interface to the libmagic file type identification library.
  • watchdog - API and shell utilities to monitor file system events.
  • watchfiles - Simple, modern and fast file watching and code reload in python.

Functional Programming

Functional Programming with Python.

  • coconut - A variant of Python built for simple, elegant, Pythonic functional programming.
  • cytoolz - Cython implementation of Toolz: High performance functional utilities.
  • functools - (Python standard library) Higher-order functions and operations on callable objects.
  • funcy - A fancy and practical functional tools.
  • more-itertools - More routines for operating on iterables, beyond itertools.
  • returns - A set of type-safe monads, transformers, and composition utilities.
  • toolz - A collection of functional utilities for iterators, functions, and dictionaries.

GUI Development

Libraries for working with graphical user interface applications.

  • curses - Built-in wrapper for ncurses used to create terminal GUI applications.
  • customtkinter - A modern and customizable python UI-library based on Tkinter.
  • dearpygui - A Simple GPU accelerated Python GUI framework
  • enaml - Creating beautiful user-interfaces with Declarative Syntax like QML.
  • flet - Cross-platform GUI framework for building modern apps in pure Python.
  • flexx - Flexx is a pure Python toolkit for creating GUI's, that uses web technology for its rendering.
  • gooey - Turn command line programs into a full GUI application with one line.
  • kivy - A library for creating NUI applications, running on Windows, Linux, Mac OS X, Android and iOS.
  • nicegui - An easy-to-use, Python-based UI framework, which shows up in your web browser.
  • pyglet - A cross-platform windowing and multimedia library for Python.
  • pygobject - Python Bindings for GLib/GObject/GIO/GTK+ (GTK+3).
  • PyQt - Python bindings for the Qt cross-platform application and UI framework.
  • pyside - Qt for Python offers the official Python bindings for Qt, this is same as PyQt but it's the official binding with different licensing.
  • pywebview - A lightweight cross-platform native wrapper around a webview component.
  • tkinter - (Python standard library) The standard Python interface to the Tcl/Tk GUI toolkit.
  • toga - A Python native, OS native GUI toolkit.
  • urwid - A library for creating terminal GUI applications with strong support for widgets, events, rich colors, etc.
  • wxPython - A blending of the wxWidgets C++ class library with the Python.

Game Development

Awesome game development libraries.

  • arcade - Arcade is a modern Python framework for crafting games with compelling graphics and sound.
  • cocos - A framework for building 2D games, demos, and other graphical/interactive applications.
  • harfang3d - Python framework for 3D, VR and game development.
  • panda3d - 3D game engine developed by Disney.
  • py-sdl2 - A ctypes based wrapper for the SDL2 library.
  • pygame - Pygame is a set of Python modules designed for writing games.
  • pyopengl - Python ctypes bindings for OpenGL and it's related APIs.
  • renpy - A Visual Novel engine.

Geolocation

Libraries for geocoding addresses and working with latitudes and longitudes.

  • django-countries - A Django app that provides a country field for models and forms.
  • geodjango - A world-class geographic web framework.
  • geojson - Python bindings and utilities for GeoJSON.
  • geopandas - Python tools for geographic data (GeoSeries/GeoDataFrame) built on pandas.
  • geopy - Python Geocoding Toolbox.

HTML Manipulation

Libraries for working with HTML and XML.

  • beautifulsoup - Providing Pythonic idioms for iterating, searching, and modifying HTML or XML.
  • cssutils - A CSS library for Python.
  • html5lib - A standards-compliant library for parsing and serializing HTML documents and fragments.
  • justhtml - A pure Python HTML5 parser that just works.
  • lxml - A very fast, easy-to-use and versatile library for handling HTML and XML.
  • markupsafe - Implements a XML/HTML/XHTML Markup safe string for Python.
  • pyquery - A jQuery-like library for parsing HTML.
  • untangle - Converts XML documents to Python objects for easy access.
  • xmldataset - Simple XML Parsing.
  • xmltodict - Working with XML feel like you are working with JSON.

HTTP Clients

Libraries for working with HTTP.

  • aiohttp - Asynchronous HTTP client/server framework for asyncio and Python.
  • httpx - A next generation HTTP client for Python.
  • requests - HTTP Requests for Humans.
  • treq - Python requests like API built on top of Twisted's HTTP client.
  • urllib3 - A HTTP library with thread-safe connection pooling, file post support, sanity friendly.

Hardware

Libraries for programming with hardware.

  • bleak - A cross platform Bluetooth Low Energy Client for Python using asyncio.
  • mouse - Hook and simulate global mouse events on Windows and Linux.
  • pynput - A library to control and monitor input devices.

Image Processing

Libraries for manipulating images.

  • pillow - Pillow is the friendly PIL fork.
  • pymatting - A library for alpha matting.
  • python-barcode - Create barcodes in Python with no extra dependencies.
  • python-qrcode - A pure Python QR Code generator.
  • pyvips - A fast image processing library with low memory needs.
  • quads - Computer art based on quadtrees.
  • scikit-image - A Python library for (scientific) image processing.
  • thumbor - A smart imaging service. It enables on-demand crop, re-sizing and flipping of images.
  • wand - Python bindings for MagickWand, C API for ImageMagick.

Implementations

Implementations of Python.

  • clpython - Implementation of the Python programming language written in Common Lisp.
  • cpython - Default, most widely used implementation of the Python programming language written in C.
  • cython - Optimizing Static Compiler for Python.
  • ironpython - Implementation of the Python programming language written in C#.
  • micropython - A lean and efficient Python programming language implementation.
  • PeachPy - x86-64 assembler embedded in Python.
  • pypy - A very fast and compliant implementation of the Python language.
  • pyston - A Python implementation using JIT techniques.

Interactive Interpreter

Interactive Python interpreters (REPL).

  • bpython - A fancy interface to the Python interpreter.
  • jupyter - A rich toolkit to help you make the most out of using Python interactively.
  • marimo - Transform data and train models, feels like a next-gen notebook, stored as Git-friendly Python.
  • ptpython - Advanced Python REPL built on top of the python-prompt-toolkit.

Job Scheduler

Libraries for scheduling jobs.

  • airflow - Airflow is a platform to programmatically author, schedule and monitor workflows.
  • apscheduler - A light but powerful in-process task scheduler that lets you schedule functions.
  • dagster - An orchestration platform for the development, production, and observation of data assets.
  • prefect - A modern workflow orchestration framework that makes it easy to build, schedule and monitor robust data pipelines.
  • schedule - Python job scheduling for humans.
  • SpiffWorkflow - A powerful workflow engine implemented in pure Python.
  • taskflow - A Python library that helps to make task execution easy, consistent and reliable.

Logging

Libraries for generating and working with logs.

  • logbook - Logging replacement for Python.
  • logging - (Python standard library) Logging facility for Python.
  • loguru - Library which aims to bring enjoyable logging in Python.
  • structlog - Structured logging made easy.

Machine Learning

Libraries for Machine Learning. Also see awesome-machine-learning.

  • catboost - A fast, scalable, high performance gradient boosting on decision trees library.
  • feature_engine - sklearn compatible API with the widest toolset for feature engineering and selection.
  • karateclub - Unsupervised machine learning toolbox for graph structured data.
  • h2o - Open Source Fast Scalable Machine Learning Platform.
  • lightgbm - A fast, distributed, high performance gradient boosting framework.
  • mindsdb - MindsDB is an open source AI layer for existing databases that allows you to effortlessly develop, train and deploy state-of-the-art machine learning models using standard queries.
  • pgmpy - A Python library for probabilistic graphical models and Bayesian networks.
  • scikit-learn - The most popular Python library for Machine Learning with extensive documentation and community support.
  • spark.ml - Apache Spark's scalable Machine Learning library for distributed computing.
  • xgboost - A scalable, portable, and distributed gradient boosting library.

Microsoft Windows

Python programming on Microsoft Windows.

  • pythonnet - Python Integration with the .NET Common Language Runtime (CLR).
  • pywin32 - Python Extensions for Windows.
  • winpython - Portable development environment for Windows 10/11.

Miscellaneous

Useful libraries or tools that don't fit in the categories above.

  • blinker - A fast Python in-process signal/event dispatching system.
  • boltons - A set of pure-Python utilities.
  • itsdangerous - Various helpers to pass trusted data to untrusted environments.
  • pluginbase - A simple but flexible plugin system for Python.
  • tryton - A general-purpose business framework.

Natural Language Processing

Libraries for working with human languages.

  • General
    • gensim - Topic Modeling for Humans.
    • langid.py - Stand-alone language identification system.
    • nltk - A leading platform for building Python programs to work with human language data.
    • pattern - A web mining module.
    • polyglot - Natural language pipeline supporting hundreds of languages.
    • spacy - A library for industrial-strength natural language processing in Python and Cython.
    • stanza - The Stanford NLP Group's official Python library, supporting 60+ languages.
  • Chinese
    • funnlp - A collection of tools and datasets for Chinese NLP.
    • jieba - The most popular Chinese text segmentation library.
    • pkuseg-python - A toolkit for Chinese word segmentation in various domains.
    • snownlp - A library for processing Chinese text.

Network Virtualization

Tools and libraries for Virtual Networking and SDN (Software Defined Networking).

  • mininet - A popular network emulator and API written in Python.
  • napalm - Cross-vendor API to manipulate network devices.
  • pox - A Python-based SDN control applications, such as OpenFlow SDN controllers.
  • scapy - A brilliant packet manipulation library.

ORM

Libraries that implement Object-Relational Mapping or data mapping techniques.

  • Relational Databases
    • django.db.models - The Django ORM.
    • sqlalchemy - The Python SQL Toolkit and Object Relational Mapper.
    • dataset - Store Python dicts in a database - works with SQLite, MySQL, and PostgreSQL.
    • peewee - A small, expressive ORM.
    • pony - ORM that provides a generator-oriented interface to SQL.
    • sqlmodel - SQLModel is based on Python type annotations, and powered by Pydantic and SQLAlchemy.
    • tortoise-orm - An easy-to-use asyncio ORM inspired by Django, with relations support.
  • NoSQL Databases
    • beanie - An asynchronous Python object-document mapper (ODM) for MongoDB.
    • mongoengine - A Python Object-Document-Mapper for working with MongoDB.
    • odmantic - Sync and Async ODM for MongoDB built on top of Pydantic for model definition and validation.
    • pynamodb - A Pythonic interface for Amazon DynamoDB.

Package Management

Libraries for package and dependency management.

  • conda - Cross-platform, Python-agnostic binary package manager.
  • mamba - A reimplementation of the conda package manager in C++.
  • pip - The package installer for Python.
  • pipx - Install and Run Python Applications in Isolated Environments. Like npx in Node.js.
  • poetry - Python dependency management and packaging made easy.
  • uv - An extremely fast Python version, package and project manager, written in Rust.

Package Repositories

Local PyPI repository server and proxies.

  • bandersnatch - PyPI mirroring tool provided by Python Packaging Authority (PyPA).
  • devpi - PyPI server and packaging/testing/release tool.
  • warehouse - Next generation Python Package Repository (PyPI).

Penetration Testing

Frameworks and tools for penetration testing.

  • fsociety - A Penetration testing framework.
  • setoolkit - A toolkit for social engineering.
  • sherlock - Hunt down social media accounts by username across social networks.
  • sqlmap - Automatic SQL injection and database takeover tool.

Processes

Libraries for starting and communicating with OS processes.

Quantum Computing

Libraries for quantum computing.

  • Cirq — A Google-developed framework focused on hardware-aware quantum circuit design for NISQ devices.
  • pennylane — A hybrid quantum-classical machine learning library with automatic differentiation support.
  • qiskit — An IBM-backed quantum SDK for building, simulating, and running circuits on real quantum hardware.
  • qutip - Quantum Toolbox in Python.

Recommender Systems

Libraries for building recommender systems.

  • annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage.
  • implicit - A fast Python implementation of collaborative filtering for implicit datasets.
  • lightfm - A Python implementation of a number of popular recommendation algorithms.
  • scikit-surprise - A scikit for building and analyzing recommender systems.
  • spotlight - Deep recommender models using PyTorch.

Web APIs

Libraries for building RESTful and GraphQL APIs.

  • Django
  • Flask
    • eve - REST API framework powered by Flask, MongoDB and good intentions.
    • flask-api - Browsable Web APIs for Flask.
    • flask-restful - Quickly building REST APIs for Flask.
  • Pyramid
    • cornice - A RESTful framework for Pyramid.
  • Framework Agnostic
    • falcon - A high-performance framework for building cloud APIs and web app backends.
    • fastapi - A modern, fast, web framework for building APIs with Python 3.6+ based on standard Python type hints.
    • graphene - GraphQL framework for Python.
    • hug - A Python 3 framework for cleanly exposing APIs.
    • sandman2 - Automated REST APIs for existing database-driven systems.
    • sanic - A Python 3.6+ web server and web framework that's written to go fast.
    • webargs - A friendly library for parsing HTTP request arguments with built-in support for popular web frameworks.

RPC Servers

RPC-compatible servers.

  • rpyc (Remote Python Call) - A transparent and symmetric RPC library for Python
  • zerorpc - zerorpc is a flexible RPC implementation based on ZeroMQ and MessagePack.

Science

Libraries for scientific computing. Also see Python-for-Scientists.

  • astropy - A community Python library for Astronomy.
  • bcbb - Collection of useful code related to biological analysis.
  • biopython - Biopython is a set of freely available tools for biological computation.
  • cclib - A library for parsing and interpreting the results of computational chemistry packages.
  • colour - Implementing a comprehensive number of colour theory transformations and algorithms.
  • manim - An animation engine for explanatory math videos.
  • networkx - A high-productivity software for complex networks.
  • numba - Python JIT compiler to LLVM aimed at scientific Python.
  • nipy - A collection of neuroimaging toolkits.
  • numpy - A fundamental package for scientific computing with Python.
  • obspy - A Python toolbox for seismology.
  • openbabel - A chemical toolbox designed to speak the many languages of chemical data.
  • pathsim - A block-based system modeling and simulation framework with a browser-based visual editor.
  • pydy - Short for Python Dynamics, used to assist with workflow in the modeling of dynamic motion.
  • pymc - Markov Chain Monte Carlo sampling toolkit.
  • PythonRobotics - This is a compilation of various robotics algorithms with visualizations.
  • rdkit - Cheminformatics and Machine Learning Software.
  • scipy - A Python-based ecosystem of open-source software for mathematics, science, and engineering.
  • shapely - Manipulation and analysis of geometric objects in the Cartesian plane.
  • simpy - A process-based discrete-event simulation framework.
  • statsmodels - Statistical modeling and econometrics in Python.
  • sympy - A Python library for symbolic mathematics.

Libraries and software for indexing and performing search queries on data.

Serialization

Libraries for serializing complex data types.

  • marshmallow - A lightweight library for converting complex objects to and from simple Python datatypes.
  • msgpack - MessagePack serializer implementation for Python.
  • orjson - Fast, correct JSON library.
  • pysimdjson - A Python bindings for simdjson.
  • python-rapidjson - A Python wrapper around RapidJSON.
  • ultrajson - A fast JSON decoder and encoder written in C with Python bindings.

Specific Formats Processing

Libraries for parsing and manipulating specific text formats.

  • General
    • docling - Library for converting documents into structured data.
    • kreuzberg - High-performance document extraction library with a Rust core, supporting 62+ formats including PDF, Office, images with OCR, HTML, email, and archives.
    • pyelftools - Parsing and analyzing ELF files and DWARF debugging information.
    • tablib - A module for Tabular Datasets in XLS, CSV, JSON, YAML.
    • textract - Extract text from any document, Word, PowerPoint, PDFs, etc.
  • Office
    • docxtpl - Editing a docx document by jinja2 template
    • openpyxl - A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files.
    • pyexcel - Providing one API for reading, manipulating and writing csv, ods, xls, xlsx and xlsm files.
    • python-docx - Reads, queries and modifies Microsoft Word 2007/2008 docx files.
    • python-pptx - Python library for creating and updating PowerPoint (.pptx) files.
    • xlsxwriter - A Python module for creating Excel .xlsx files.
    • xlwings - A BSD-licensed library that makes it easy to call Python from Excel and vice versa.
  • PDF
    • pdf_oxide - A fast PDF library for text extraction, image extraction, and markdown conversion, powered by Rust.
    • pdfminer.six - Pdfminer.six is a community maintained fork of the original PDFMiner.
    • pikepdf - A powerful library for reading and editing PDF files, based on qpdf.
    • PyPDF2 - A library capable of splitting, merging and transforming PDF pages.
    • reportlab - Allowing Rapid creation of rich PDF documents.
    • weasyprint - A visual rendering engine for HTML and CSS that can export to PDF.
  • Markdown
    • markdown-it-py - Markdown parser with 100% CommonMark support, extensions, and syntax plugins.
    • markdown - A Python implementation of John Grubers Markdown.
    • markitdown - Python tool for converting files and office documents to Markdown.
    • mistune - Fastest and full featured pure Python parsers of Markdown.
  • YAML
    • pyyaml - YAML implementations for Python.
  • TOML
    • tomllib - (Python standard library) Parse TOML files.
  • CSV
    • csvkit - Utilities for converting to and working with CSV.
  • Archive
    • unp - A command line tool that can unpack archives easily.

Static Site Generator

Static site generator is a software that takes some text + templates as input and produces HTML files on the output.

  • lektor - An easy to use static CMS and blog engine.
  • makesite - Simple, lightweight, and magic-free static site/blog generator (< 130 lines).
  • nikola - A static website and blog generator.
  • pelican - Static site generator that supports Markdown and reST syntax.

Task Queues

Libraries for working with task queues.

  • celery - An asynchronous task queue/job queue based on distributed message passing.
    • flower - Real-time monitor and web admin for Celery.
  • dramatiq - A fast and reliable background task processing library for Python 3.
  • huey - Little multi-threaded task queue.
  • mrq - A distributed worker task queue in Python using Redis & gevent.
  • rq - Simple job queues for Python.

Template Engine

Libraries and tools for templating and lexing.

  • genshi - Python templating toolkit for generation of web-aware output.
  • jinja - A modern and designer friendly templating language.
  • mako - Hyperfast and lightweight templating for the Python platform.

Testing

Libraries for testing codebases and generating test data.

  • Testing Frameworks
    • hypothesis - Hypothesis is an advanced Quickcheck style property based testing library.
    • nose2 - The successor to nose, based on unittest2.
    • pytest - A mature full-featured Python testing tool.
    • robotframework - A generic test automation framework.
    • scanapi - Automated Testing and Documentation for your REST API.
    • unittest - (Python standard library) Unit testing framework.
  • Test Runners
    • green - A clean, colorful test runner.
    • nox - Flexible test automation for Python.
    • tox - Auto builds and tests distributions in multiple Python versions
  • GUI / Web Testing
    • locust - Scalable user load testing tool written in Python.
    • playwright - Python version of the Playwright testing and automation library.
    • pyautogui - PyAutoGUI is a cross-platform GUI automation Python module for human beings.
    • schemathesis - A tool for automatic property-based testing of web applications built with Open API / Swagger specifications.
    • selenium - Python bindings for Selenium WebDriver.
    • splinter - Open source tool for testing web applications.
  • Mock
    • doublex - Powerful test doubles framework for Python.
    • freezegun - Travel through time by mocking the datetime module.
    • httmock - A mocking library for requests for Python 2.6+ and 3.2+.
    • httpretty - HTTP request mock tool for Python.
    • mock - (Python standard library) A mocking and patching library.
    • mocket - A socket mock framework with gevent/asyncio/SSL support.
    • responses - A utility library for mocking out the requests Python library.
    • vcrpy - Record and replay HTTP interactions on your tests.
  • Object Factories
    • factory_boy - A test fixtures replacement for Python.
    • mixer - Another fixtures replacement. Supports Django, Flask, SQLAlchemy, Peewee and etc.
    • polyfactory - mock data generation library with support to classes (continuation of pydantic-factories)
  • Code Coverage
  • Fake Data
    • faker - A Python package that generates fake data.
    • mimesis - is a Python library that help you generate fake data.

Text Processing

Libraries for parsing and manipulating plain texts.

  • General
    • babel - An internationalization library for Python.
    • chardet - Python 2/3 compatible character encoding detector.
    • difflib - (Python standard library) Helpers for computing deltas.
    • ftfy - Makes Unicode text less broken and more consistent automagically.
    • Levenshtein - Fast computation of Levenshtein distance and string similarity.
    • pangu.py - Paranoid text spacing.
    • pyfiglet - An implementation of figlet written in Python.
    • pypinyin - Convert Chinese hanzi (漢字) to pinyin (拼音).
    • textdistance - Compute distance between sequences with 30+ algorithms.
    • unidecode - ASCII transliterations of Unicode text.
  • Slugify
    • awesome-slugify - A Python slugify library that can preserve unicode.
    • python-slugify - A Python slugify library that translates unicode to ASCII.
    • unicode-slugify - A slugifier that generates unicode slugs with Django as a dependency.
  • Unique identifiers
    • hashids - Implementation of hashids in Python.
    • shortuuid - A generator library for concise, unambiguous and URL-safe UUIDs.
  • Parser

URL Manipulation

Libraries for parsing URLs.

  • furl - A small Python library that makes parsing and manipulating URLs easy.
  • purl - A simple, immutable URL class with a clean API for interrogation and manipulation.

Video

Libraries for manipulating video and GIFs.

  • moviepy - A module for script-based movie editing with many formats, including animated GIFs.
  • scikit-video - Video processing routines for SciPy.
  • vidgear - Most Powerful multi-threaded Video Processing framework.

Web Asset Management

Tools for managing, compressing and minifying website assets.

  • django-compressor - Compresses linked and inline JavaScript or CSS into a single cached file.
  • django-pipeline - An asset packaging library for Django.
  • django-storages - A collection of custom storage back ends for Django.
  • fanstatic - Packages, optimizes, and serves static file dependencies as Python packages.
  • flask-assets - Helps you integrate webassets into your Flask app.
  • webassets - Bundles, optimizes, and manages unique cache-busting URLs for static resources.

Web Content Extracting

Libraries for extracting web contents.

  • html2text - Convert HTML to Markdown-formatted text.
  • lassie - Web Content Retrieval for Humans.
  • micawber - A small library for extracting rich content from URLs.
  • newspaper - News extraction, article extraction and content curation in Python.
  • python-readability - Fast Python port of arc90's readability tool.
  • requests-html - Pythonic HTML Parsing for Humans.
  • sumy - A module for automatic summarization of text documents and HTML pages.
  • toapi - Every web site provides APIs.

Web Crawling

Libraries to automate web scraping.

  • browser-use - Make websites accessible for AI agents with easy browser automation.
  • crawl4ai - An open-source, LLM-friendly web crawler that provides lightning-fast, structured data extraction specifically designed for AI agents.
  • feedparser - Universal feed parser.
  • grab - Site scraping framework.
  • mechanicalsoup - A Python library for automating interaction with websites.
  • scrapy - A fast high-level screen scraping and web crawling framework.

Web Frameworks

Traditional full stack web frameworks. Also see RESTful API.

  • Synchronous
  • Asynchronous
    • litestar - Production-ready, capable and extensible ASGI Web framework.
    • microdot - The impossibly small web framework for Python and MicroPython.
    • reflex A framework for building reactive, full-stack web applications entirely with python .
    • tornado - A web framework and asynchronous networking library.

WebSocket

Libraries for working with WebSocket.

  • autobahn-python - WebSocket & WAMP for Python on Twisted and asyncio.
  • channels - Developer-friendly asynchrony for Django.
  • websockets - A library for building WebSocket servers and clients with a focus on correctness and simplicity.

Resources

Where to discover learning resources or new Python libraries.

Newsletters

Podcasts

Contributing

Your contributions are always welcome! Please take a look at the contribution guidelines first.


If you have any question about this opinionated list, do not hesitate to contact @VintaChen on Twitter.