Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
-
Updated
Jun 11, 2024 - Python
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Full typescript rewrite of the AO-connect package - 🐰 🕳️ 👈
A Python library for calculating a large variety of metrics from text
GoodJsCode™ guidebook. Learn the best coding practices and standards for clean, efficient and quality JavaScript code that lasts long! 💪
ESLint plugin for John Resig-style micro template, Lodash's template, Underscore's template and EJS.
✨ This is the official Punctual Letters repository ✨ The app for people with ADHD who love to read ✨
A C# port of standalone version of the readability lib
A browser extension for Google Calendar. Provides reader view, saving articles (to GitHub repository), and generation of stats from your saved articles
Offical OpenDyslexic browser extension
Generating Summaries with Controllable Readability Levels (EMNLP 2023)
The project titled "Balance Sheet" is a simple yet effective web page created using HTML and CSS, with guidance and resources from FreeCodeCamp. It displays the financial status of AcmeWidgetCorp for the years 2019, 2020, and 2021, focusing on readability, accessibility, and responsiveness.
Reduce content complexity
Extractum is a PHP library that extracts information from web pages.
Go package that cleans a HTML page for better readability.
📝 python package to calculate readability statistics of a text object - paragraphs, sentences, articles.
Suggested Coding Guidelines for Technical Artists using Python.
An HTTP proxy that parses only text, links and pictures from pages reducing internet bandwidth usage, removing ads and heavy scripts
SmartReader is a library to extract the main content of a web page, based on a port of the Readability library by Mozilla
Add a description, image, and links to the readability topic page so that developers can more easily learn about it.
To associate your repository with the readability topic, visit your repo's landing page and select "manage topics."