mirror of
https://github.com/vinta/awesome-python.git
synced 2026-03-23 22:03:12 +08:00
refactor: consolidate thin subcategories and split Data Analysis by domain
- Merge URL Manipulation (single entry) into HTTP Clients - Move python-slugify into General Text Processing, removing the one-entry Slugify subcategory - Consolidate YAML, TOML, and CSV subcategories into a single Data Formats group - Split Data Analysis into General and Financial Data subcategories to improve discoverability Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
23
README.md
23
README.md
@@ -32,7 +32,6 @@ An opinionated list of awesome Python frameworks, libraries, tools, software and
|
|||||||
|
|
||||||
- [HTTP Clients](#http-clients)
|
- [HTTP Clients](#http-clients)
|
||||||
- [Web Scraping](#web-scraping)
|
- [Web Scraping](#web-scraping)
|
||||||
- [URL Manipulation](#url-manipulation)
|
|
||||||
- [Email](#email)
|
- [Email](#email)
|
||||||
|
|
||||||
**Database & Storage**
|
**Database & Storage**
|
||||||
@@ -351,6 +350,7 @@ _Static site generator is a software that takes some text + templates as input a
|
|||||||
_Libraries for working with HTTP._
|
_Libraries for working with HTTP._
|
||||||
|
|
||||||
- [aiohttp](https://github.com/aio-libs/aiohttp) - Asynchronous HTTP client/server framework for asyncio and Python.
|
- [aiohttp](https://github.com/aio-libs/aiohttp) - Asynchronous HTTP client/server framework for asyncio and Python.
|
||||||
|
- [furl](https://github.com/gruns/furl) - A small Python library that makes parsing and manipulating URLs easy.
|
||||||
- [httpx](https://github.com/encode/httpx) - A next generation HTTP client for Python.
|
- [httpx](https://github.com/encode/httpx) - A next generation HTTP client for Python.
|
||||||
- [requests](https://github.com/psf/requests) - HTTP Requests for Humans.
|
- [requests](https://github.com/psf/requests) - HTTP Requests for Humans.
|
||||||
- [treq](https://github.com/twisted/treq) - Python requests like API built on top of Twisted's HTTP client.
|
- [treq](https://github.com/twisted/treq) - Python requests like API built on top of Twisted's HTTP client.
|
||||||
@@ -375,12 +375,6 @@ _Libraries to automate web scraping and extract web content._
|
|||||||
- [toapi](https://github.com/gaojiuli/toapi) - Every web site provides APIs.
|
- [toapi](https://github.com/gaojiuli/toapi) - Every web site provides APIs.
|
||||||
- [trafilatura](https://github.com/adbar/trafilatura) - A tool for gathering text and metadata from the web, with built-in content filtering.
|
- [trafilatura](https://github.com/adbar/trafilatura) - A tool for gathering text and metadata from the web, with built-in content filtering.
|
||||||
|
|
||||||
## URL Manipulation
|
|
||||||
|
|
||||||
_Libraries for parsing URLs._
|
|
||||||
|
|
||||||
- [furl](https://github.com/gruns/furl) - A small Python library that makes parsing and manipulating URLs easy.
|
|
||||||
|
|
||||||
## Email
|
## Email
|
||||||
|
|
||||||
_Libraries for sending and parsing email._
|
_Libraries for sending and parsing email._
|
||||||
@@ -482,6 +476,7 @@ _Libraries for serializing complex data types._
|
|||||||
|
|
||||||
_Libraries for data analysis._
|
_Libraries for data analysis._
|
||||||
|
|
||||||
|
- General
|
||||||
- [aws-sdk-pandas](https://github.com/aws/aws-sdk-pandas) - Pandas on AWS.
|
- [aws-sdk-pandas](https://github.com/aws/aws-sdk-pandas) - Pandas on AWS.
|
||||||
- [datasette](https://github.com/simonw/datasette) - An open source multi-tool for exploring and publishing data.
|
- [datasette](https://github.com/simonw/datasette) - An open source multi-tool for exploring and publishing data.
|
||||||
- [desbordante](https://github.com/desbordante/desbordante-core/) - An open source data profiler for complex pattern discovery.
|
- [desbordante](https://github.com/desbordante/desbordante-core/) - An open source data profiler for complex pattern discovery.
|
||||||
@@ -490,10 +485,11 @@ _Libraries for data analysis._
|
|||||||
- [optimus](https://github.com/hi-primus/optimus) - Agile Data Science Workflows made easy with PySpark.
|
- [optimus](https://github.com/hi-primus/optimus) - Agile Data Science Workflows made easy with PySpark.
|
||||||
- [pandas](https://github.com/pandas-dev/pandas) - A library providing high-performance, easy-to-use data structures and data analysis tools.
|
- [pandas](https://github.com/pandas-dev/pandas) - A library providing high-performance, easy-to-use data structures and data analysis tools.
|
||||||
- [pathway](https://github.com/pathwaycom/pathway) - Real-time data processing framework for Python with reactive dataflows.
|
- [pathway](https://github.com/pathwaycom/pathway) - Real-time data processing framework for Python with reactive dataflows.
|
||||||
|
- [polars](https://github.com/pola-rs/polars) - A fast DataFrame library implemented in Rust with a Python API.
|
||||||
|
- Financial Data
|
||||||
- [akshare](https://github.com/jindaxiang/akshare) - A financial data interface library, built for human beings!
|
- [akshare](https://github.com/jindaxiang/akshare) - A financial data interface library, built for human beings!
|
||||||
- [edgartools](https://github.com/dgunning/edgartools) - Library for downloading structured data from SEC EDGAR filings and XBRL financial statements.
|
- [edgartools](https://github.com/dgunning/edgartools) - Library for downloading structured data from SEC EDGAR filings and XBRL financial statements.
|
||||||
- [openbb](https://github.com/OpenBB-finance/OpenBB) - A financial data platform for analysts, quants and AI agents.
|
- [openbb](https://github.com/OpenBB-finance/OpenBB) - A financial data platform for analysts, quants and AI agents.
|
||||||
- [polars](https://github.com/pola-rs/polars) - A fast DataFrame library implemented in Rust with a Python API.
|
|
||||||
- [yfinance](https://github.com/ranaroussi/yfinance) - Easy Pythonic way to download market and financial data from Yahoo Finance.
|
- [yfinance](https://github.com/ranaroussi/yfinance) - Easy Pythonic way to download market and financial data from Yahoo Finance.
|
||||||
|
|
||||||
## Data Validation
|
## Data Validation
|
||||||
@@ -914,10 +910,9 @@ _Libraries for parsing and manipulating plain texts._
|
|||||||
- [pangu.py](https://github.com/vinta/pangu.py) - Paranoid text spacing.
|
- [pangu.py](https://github.com/vinta/pangu.py) - Paranoid text spacing.
|
||||||
- [pyfiglet](https://github.com/pwaller/pyfiglet) - An implementation of figlet written in Python.
|
- [pyfiglet](https://github.com/pwaller/pyfiglet) - An implementation of figlet written in Python.
|
||||||
- [pypinyin](https://github.com/mozillazg/python-pinyin) - Convert Chinese hanzi (漢字) to pinyin (拼音).
|
- [pypinyin](https://github.com/mozillazg/python-pinyin) - Convert Chinese hanzi (漢字) to pinyin (拼音).
|
||||||
|
- [python-slugify](https://github.com/un33k/python-slugify) - A Python slugify library that translates unicode to ASCII.
|
||||||
- [textdistance](https://github.com/orsinium/textdistance) - Compute distance between sequences with 30+ algorithms.
|
- [textdistance](https://github.com/orsinium/textdistance) - Compute distance between sequences with 30+ algorithms.
|
||||||
- [unidecode](https://github.com/avian2/unidecode) - ASCII transliterations of Unicode text.
|
- [unidecode](https://github.com/avian2/unidecode) - ASCII transliterations of Unicode text.
|
||||||
- Slugify
|
|
||||||
- [python-slugify](https://github.com/un33k/python-slugify) - A Python slugify library that translates unicode to ASCII.
|
|
||||||
- Unique identifiers
|
- Unique identifiers
|
||||||
- [hashids](https://github.com/davidaurelio/hashids-python) - Implementation of [hashids](http://hashids.org) in Python.
|
- [hashids](https://github.com/davidaurelio/hashids-python) - Implementation of [hashids](http://hashids.org) in Python.
|
||||||
- [shortuuid](https://github.com/skorokithakis/shortuuid) - A generator library for concise, unambiguous and URL-safe UUIDs.
|
- [shortuuid](https://github.com/skorokithakis/shortuuid) - A generator library for concise, unambiguous and URL-safe UUIDs.
|
||||||
@@ -973,12 +968,10 @@ _Libraries for parsing and manipulating specific text formats._
|
|||||||
- [markdown](https://github.com/waylan/Python-Markdown) - A Python implementation of John Gruber’s Markdown.
|
- [markdown](https://github.com/waylan/Python-Markdown) - A Python implementation of John Gruber’s Markdown.
|
||||||
- [markitdown](https://github.com/microsoft/markitdown) - Python tool for converting files and office documents to Markdown.
|
- [markitdown](https://github.com/microsoft/markitdown) - Python tool for converting files and office documents to Markdown.
|
||||||
- [mistune](https://github.com/lepture/mistune) - Fastest and full featured pure Python parsers of Markdown.
|
- [mistune](https://github.com/lepture/mistune) - Fastest and full featured pure Python parsers of Markdown.
|
||||||
- YAML
|
- Data Formats
|
||||||
- [pyyaml](https://github.com/yaml/pyyaml) - YAML implementations for Python.
|
|
||||||
- TOML
|
|
||||||
- [tomllib](https://docs.python.org/3/library/tomllib.html) - (Python standard library) Parse TOML files.
|
|
||||||
- CSV
|
|
||||||
- [csvkit](https://github.com/wireservice/csvkit) - Utilities for converting to and working with CSV.
|
- [csvkit](https://github.com/wireservice/csvkit) - Utilities for converting to and working with CSV.
|
||||||
|
- [pyyaml](https://github.com/yaml/pyyaml) - YAML implementations for Python.
|
||||||
|
- [tomllib](https://docs.python.org/3/library/tomllib.html) - (Python standard library) Parse TOML files.
|
||||||
|
|
||||||
## File Manipulation
|
## File Manipulation
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user