-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
comfy-table
An easy to use library for building beautiful tables with automatic content wrapping
-
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
similar
A diff library for Rust
-
const_format
Compile-time string formatting
-
heck
case conversion library
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
tabled
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
convert_case
Convert strings into any case
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
lazy-regex
lazy static regular expressions checked at compile time
-
pulldown-cmark
A pull parser for CommonMark
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
lopdf
PDF document manipulation
-
termimad
Markdown Renderer for the Terminal
-
unicode-bidi
Unicode Bidirectional Algorithm
-
unicase
A case-insensitive wrapper around strings
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
mdbook
Creates a book from markdown files
-
prettydiff
Side-by-side diff for two files
-
html2text
Render HTML as plain text
-
fuzzy-matcher
Fuzzy Matching Library
-
regress
A regular expression engine targeting EcmaScript syntax
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
const-str
compile-time string operations
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
printpdf
reading and writing PDF files
-
lindera
A morphological analysis library
-
charabia
detect the language, tokenize the text and normalize the tokens
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
roff
ROFF (man page format) generation library
-
mdxjs
Compile MDX to JavaScript in Rust
-
usearch
Smaller & Faster Single-File Vector Search Engine from Unum
-
garde
Validation library
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
diff
An LCS based slice and string diffing implementation
-
synoptic
low-level, syntax highlighting library with unicode support
-
diffy
Tools for finding and manipulating differences between files
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
text-size
Newtypes for text offsets
-
str_indices
Count and convert between indexing schemes on string slices
-
nucleo
plug and play high performance fuzzy matcher
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
os_display
Display strings in a safe platform-appropriate way
-
arrow-cast
Cast kernel and utilities for Apache Arrow
-
chardetng
A character encoding detector for legacy Web content
-
xan
The CSV magician
-
hyperlink
Very fast link checker for CI
-
smartcat
Putting a brain behind
cat
. CLI interface to bring language models in the Unix ecosystem 🐈⬛ -
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
entities
raw data needed to convert to and from HTML entities
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
ascii
ASCII-only equivalents to
char
,str
andString
-
route-recognizer
Recognizes URL patterns with support for dynamic and wildcard segments
-
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
pact_consumer
Pact-Rust module that provides support for writing consumer pact tests
-
whyq
jq wrapper
-
epub-builder
generating EPUB files
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
stringsext
find multi-byte-encoded strings in binary data
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
ferris-says
flavored replacement for the classic cowsay
-
text_io
really simple to use panicking input functions
-
regex-cursor
regex fork that can search discontiguous haystacks
-
rumdl
A fast Markdown linter written in Rust (Ru(st) MarkDown Linter)
-
unindent
Remove a column of leading whitespace from a string
-
ncount
A word count tool intended to derive useful stats from markdown
-
mdbook-yapp
A mdBook preprocessor for simple text replacements
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
prema
convert markdown to html
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
pdf-extract
extract content from pdfs
-
llmvm-core
The core application for llmvm
-
mdbook-pandoc
A pandoc-powered mdbook backend
-
mkrs
Build automation tool
-
hgrep
grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
apisnip
A terminal user interface (TUI) tool for trimming OpenAPI specifications down to size ✂️
-
font-types
Scalar types used in fonts
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
hck
A sharp cut(1) clone
-
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
sscanf
(inverse of format!()) Macro based on Regex
-
za
🛠️ Zero-to-All — scan your workspace and generate an opinionated CONTEXT.md so AI, code-reviewers and newcomers always have the full picture
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
matchers
Regex matching on character and byte streams
-
netidx
Secure, fast, pub/sub messaging
-
inlyne
Introducing Inlyne, a GPU powered yet browserless tool to help you quickly view markdown files in the blink of an eye
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
vaporetto
pointwise prediction based tokenizer
-
uwc
Counts things in unicode text files
-
harfruzz
A complete harfbuzz shaping algorithm port to Rust
-
diff-match-patch-rs
The fastest implementation of Myer's diff algorithm to perform the operations required for synchronizing plain text
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
tiefdownconverter
A CLI tool to manage and convert Markdown-based projects
-
pks
Welcome! Please see https://github.com/alexevanczuk/packs for more information!
-
http-cmd
Run a command over HTTP
-
languagetool-rust
LanguageTool API bindings in Rust
-
stop-words
Common stop words in many languages
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
markdown-tool
A CLI utility for converting Markdown into AST and vice versa
-
ra_ap_text_edit
Representation of a
TextEdit
for rust-analyzer -
zeitgrep
Find frecent results in git repositories using regular expressions
-
cargo-spellcheck
Checks all doc comments for spelling mistakes
-
iepub
epub、mobi电子书读写
-
buup
Core transformation library with zero dependencies
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
cow-utils
Copy-on-write string utilities for Rust
-
xot
Full-featured XML tree library for Rust
-
epub
support the reading of epub files
-
wildcard
matching
-
file-organiser
Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range
-
regex-syntax
A regular expression parser
-
rustc-literal-escaper
code to unescape string literals
-
elfcat
ELF visualizer. Generates HTML files from ELF binaries.
-
unicode_categories
Query Unicode category membership for chars
-
dptran
run DeepL translations on command line written by Rust
-
topiary-queries
tree-sitter query files compatible with Topiary
-
skyspell
Fast and handy spell checker for the command line
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen, giscus and many other apps
-
autosurgeon
working with data in automerge documents
-
difflib
Port of Python's difflib library to Rust
-
any_ascii
Unicode to ASCII transliteration
-
line-ending
Detect, normalize, and convert line endings across platforms, including support for character streams. Ensures consistent handling of LF, CRLF, and CR line endings in text processing.
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
tkrar
Count frequency of words in a file or a directory
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
charset
Character encoding decoding for email
-
airshipper
automatic updates for the voxel RPG Veloren
-
gemini-map
A command-line tool to run files in parallel through Google Gemini
-
html2md
binary to convert simple html documents into markdown
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
collclean
Clean up collaboration commands in LaTeX files
-
vesti
A preprocessor that compiles into LaTeX
-
moonwave
generating documentation from comments in Lua source code
-
allms
One Library to rule them aLLMs
-
mdbook-epub
An EPUB renderer for mdbook
-
near-facsimile
Find similar or identical text files in a directory
-
glyph_brush_layout
Text layout for ab_glyph
-
sortuniq
Find or count unique values in an input stream
-
arrow-string
String kernels for arrow arrays
-
unicode-ccc
Unicode Canonical Combining Class detection
-
precis-profiles
PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings Representing Usernames and Passwords as defined in rfc8265; and Nicknames as defined in rfc8266
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
unicode-reverse
Unicode-aware in-place string reversal
-
chewing
(酷音) intelligent Zhuyin input method
-
luciferous-case-converter
A CLI tool to convert text between different cases
-
word-tally
Output a tally of the number of times unique words appear in source input
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
sile
Simon’s Improved Layout Engine
-
lsp-textdocument
A LSP text documents manager that map of text document
-
dom-content-extraction
Content extraction via text density paper
-
sigrs
Interactive grep (for streaming)
-
quixote
Quizzes and tests in Markdown
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
spin-sdk
The Spin Rust SDK makes it easy to build Spin components in Rust
-
igrepper
The interactive grepper
-
srgn
A grep-like tool which understands source code syntax and allows for manipulation in addition to search
-
htmd
A turndown.js inspired HTML to Markdown converter
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
mdcat
cat for markdown: Show markdown documents in terminals
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
mdbook-quiz
Interactive quizzes for your mdBook
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
molybdenum
Recursive search and replace CLI application
-
markdown-it
Rust port of popular markdown-it.js library
-
repoyank
Interactively traverse your repository, select files/directories, and quickly prepare structured snippets for LLM interactions
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
sd
An intuitive find & replace CLI
-
scru64
Sortable, Clock-based, Realm-specifically Unique identifier
-
fontfor
find fonts which can show a specified character and preview them in terminal or browser
-
bbd
Binary Braille Dump
-
subplot
tools for specifying, documenting, and implementing automated acceptance tests for systems and software
-
hypher
separates words into syllables
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
tree-sitter-stack-graphs-typescript
Stack graphs definition for TypeScript & TSX using tree-sitter-typescript
-
json2bin
A fast jsonl to RWKV binidx converter in Rust
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
rsrpp-cli
project for research paper pdf
-
autumnus
Syntax highlighter powered by Tree-sitter and Neovim themes
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
reword
some utility functions for human-readable formatting of words
-
google-books1-cli
A complete library to interact with books (protocol v1)
-
termdiff
Write a diff with color codes to a string
-
cwc
A word counter utility that properly handles CJK and Unicode text
-
repgrep
An interactive command line replacer for
ripgrep
-
regex-literal
delimited regular expression literals
-
console_static_text
Logging for text that should stay in the same place in a console
-
swc-plugin-inferno
SWC plugin for InfernoJS
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
codebase-to-markdown
convert codebase to markdown format
-
minimizer
Minimize files to find minimal test case
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
transformers
like interface for interacting with local LLMs in Rust. This crate aims to provide the simplest interface to use to interact with local LLMs from Rust.
-
string-offsets
Converts string offsets between UTF-8 bytes, UTF-16 code units, Unicode code points, and lines
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
vew
Visualize lsof output
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
deeplx
package for unlimited DeepL translation
-
wordcut-engine
Word segmentation/breaking library
-
fm
Non-backtracking fuzzy text matcher
-
mdka
HTML to Markdown converter
-
holy-carpet
customizable blog creator
-
yake-rust
Yake (Yet Another Keyword Extractor) in Rust
-
qpdf
Rust bindings to QPDF C++ library
-
vidyut-prakriya
A Sanskrit word generator
-
pad
padding strings at runtime
-
cmark-writer
A CommonMark writer implementation in Rust for serializing AST nodes to CommonMark format
-
ident_case
applying case rules to Rust identifiers
-
roe
Unicode case conversion
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
whitespace-sifter
Sift duplicate whitespaces away!
-
byteyarn
hyper-compact strings
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
dmos
Djot HTML renderer with advanced features
-
timerfd
interface to the Linux kernel's timerfd API
-
norad
Read and write Unified Font Object files
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
codebank
powerful code documentation generator that creates structured markdown documentation from your codebase. Supports multiple languages including Rust, Python, TypeScript, C, and Go with intelligent parsing and formatting…
-
ankinase
A parser which generates Anki cards from CommonMark
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
create_broken_files
Create broken files from other ones
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
bundle_repo
Pack a local or remote Git Repository to XML for LLM Consumption
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
bashdoc
generating documentation/help menu for user defined bash functions
-
ohos-drawing-sys
Bindings to the
native_drawing
API of OpenHarmony OS -
fasttext
binding
-
COXave
Instruments for codings
-
open-lark
Lark/Feishu Open API SDK(WIP)
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
distrs
PDF, CDF, and percent-point/quantile functions for the normal and Student’s t distributions
-
portrait
Fills an
impl
with the associated items required by the trait -
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
mdbook-tailor
mdbook preprocessor for image-tailor
-
asmfmt
A formatter designed for programs in assembly language with AT&T syntax
-
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
rzozowski
A regex crate using Brzozowski derivatives
-
diesel_full_text_search
Adds support for PostgreSQL full text search to Diesel
-
mdfried
A markdown viewer for the terminal that renders images and big headers
-
instant-segment
Fast English word segmentation
-
affinidi-messaging-text-client
Affinidi Messaging SDK
-
unidown
Convert Markdown to Unicode
-
mdmodels
generate models, code and schemas from markdown files
-
newline-converter
Newline byte converter library
-
scx_lavd
Latency-criticality Aware Virtual Deadline (LAVD) scheduler based on sched_ext, which is a Linux kernel feature which enables implementing kernel thread schedulers in BPF and dynamically loading them…
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
aki-mcolor
mark up text with color
-
ripgrep_all
rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc
-
sf-api
API to send commands to the Shakes & Fidget servers and parse their responses into characters
-
stfu8
Sorta Text Format in UTF-8
-
wrapr
wrap your code for ai
-
symbolic-cfi
process call frame information
-
quagga
CLI tool that combines multiple text files into a single prompt suitable for Large Language Models
-
mdlib
A beautiful markdown note-taking application
-
lsprotocol
Rust types for Language Server Protocol generated from LSP specification
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
lumin
searching and displaying local files
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
htop
HTML to PDF converter
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
datafusion-functions
Function packages for the DataFusion query engine
-
tmenu
TUI fuzzy finder
-
mcat
a powerfull extended cat command, to cat all the things you couldn't before
-
cicero-sophia
High-performance NLU (natural language understanding) engine built in Rust for speed, accuracy, and privacy
-
clipcount
Counting words from the clipboard content
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
unidoc
Unite all Markdown
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
roman-numerals-rs
Manipulate well-formed Roman numerals
-
patto
🐙 Yet another plain text format for quick note taking and task management
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
pyo3-filelike
Rust access to Python file-like objects
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
llguidance
Super-fast Structured Outputs
-
latkerlo-jvotci
Tools for creating and decomposing Lojban lujvo
-
junit-report
Create JUnit compatible XML reports
-
substudy
Language-learning tools for working with parallel, bilingual subtitles and media files
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation
-
unic-char-property
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
-
mupdf
Safe Rust wrapper to MuPDF
-
outlines-core
Structured Generation
-
codebook_config
Configuration handling for the Codebook spell checker
-
mdbook-environment
A preprocessor for MdBook for working with environment variables
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
chonkier
🦛 Chonkie, now in Rust 🦀: No-nonsense, ultra-fast, ultra-light chunking library
-
plsfix
Text cleaner upper
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
jetstream_9p
Jetstream is a RPC framework for Rust, based on the 9P protocol and QUIC
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
weasel-gen
Random ascii generation animation until target string is met
-
rapidfuzz
rapid fuzzy string matching library
-
herring-automata
Automata construction for Herring
-
quickmd
Quickly preview a markdown file
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
cronus_generator
The generators for cronus API spec
-
twars-url2md
A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion
-
harper-ls
The language checker for developers
-
esri_ascii_grid
reading ESRI Ascii Grid .asc files
-
vader-sentimental
A faster Rust version from the original Python VaderSentiment analysis tool
-
nu_plugin_emoji
a nushell plugin called emoji
-
mandown
Markdown to groff (man page) converter
-
whitespacesv
parser/writer for the Whitespace-Separated Value format, as defined by Stenway. See https://dev.stenway.com/WSV/. WSV offers an unambiguous alternative to CSV.
-
galm
pattern matching library
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
kelp
A convert tool for Japanese
-
rustdoc-stripper
manipulate rustdoc comments
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
unbom
Remove UTF-8 BOM from files
-
sapling-streampager
streampager is a pager for command output or large files
-
jx
An interactive JSON explorer for the command line
-
drova_plugins
Main plugins for drova
-
makepad-fonts-emoji
Makepad emoji fonts
-
rust-persian-tools
Official Rust implementation of Persian Tools
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
asimov-module-cli
ASIMOV Module Command-Line Interface (CLI)
-
svgbob
Transform your ascii diagrams into happy little SVG
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
patchkit
parsing and manipulating patch files
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
unicode-properties
Query character Unicode properties according to UAX #44 and UTR #51
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
topiary-cli
CLI app for Topiary, the universal code formatter
-
quranize
Encoding transliterations into Quran forms
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
whatwg-infra
Tiny Rust-based implementation of the WHATWG Infra Standard
-
asciimath-unicode
Convert asciimath to unicode
-
bogrep
Full-text search for bookmarks from multiple browsers
-
langram
Natural language detection library
-
peppi
Parser for Slippi replay files
-
speedreader
A command-line speed reading tool
-
askalono-cli
detect the contents of license files
-
rascii_art
Advanced ASCII Art Generator
-
text-to-ascii-art
program to convert text to ASCII art
-
vlazba
Lojban words generator and analyzer
-
newline_normalizer
Zero-copy newline normalization to \n or \r\n with SIMD acceleration
-
paswitch-rs
List and swap to pulse sinks by name
-
ticker-sniffer
extracting multiple stock ticker symbols from a text document
-
thoth-note
note-taking app written in Rust
-
xpath-cli
Evaluate XPath selectors on XML or HTML documents
-
oxford_join
Join string slices with Oxford Commas!
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
async-utf8-decoder
Convert AsyncRead to incremental UTF8 string stream
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
say-rust
command-line tool which is an alternative to echo
-
htmd-cli
The command line tool for htmd
-
llm-tui
A Terminal User Interface (TUI) for interacting with Language Learning Models (LLM) using llm-cli
-
anycase
a case conversion library for Rust
-
the_rock
A command line King James bible viewer
-
emojic
Emoji constants
-
rustic_print
A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.
-
textalyzer
Analyze key metrics like number of words, readability, and complexity of any kind of text
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
codetypo-dict
Source Code Spelling Correction
-
furigana
Map furigana to a word given its reading
-
rake
Rapid Automatic Keyword Extraction (RAKE) algorithm
-
glimpse
A blazingly fast tool for peeking at codebases. Perfect for loading your codebase into an LLM's context.
-
mktoc
Generate Table of Contents from Markdown files
-
rusty-tesseract
wrapper for Google Tesseract
-
picodiff
Tiny GUI app to compare text easily
-
evcxr
An Evaluation Context for Rust
-
figlet-comment
quickly create banner to use as comments
-
codepage
Mapping between Windows code page numbers and encoding_rs character encodings
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
madato
command line tool for reading and writing tabular data (XLS, ODS, CSV, YAML), and Markdown
-
autotex
Continuously compile TeX and LaTeX
-
data-streams
Extension traits for reading and writing data with streams
-
timug
It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
nu-utils
Nushell utility functions
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
scanix
search a text or pattern in files. A fast and lightwight text tool.
-
ast-grep-language
Search and Rewrite code at large scale using precise AST pattern
-
see-cat
A cute cat(1)
-
malachi
A domain specific pattern matching language made for defining bot commands
-
secular
No Diacr!
-
indent
Functions for indenting multiline strings
-
kas-text
Text layout and font management
-
pukram-formatting
A type to represent the formatting of the pukram markup language
-
iotext_rs
IoText data protocol
-
mdbook-angular
mdbook renderer to run angular code samples
-
regexml
XPath compatible regex engine
-
libharu_ng
Easily generate PDFs from your Rust app
-
sk-skimmer
Fuzzy Finder in rust!
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
charx
A replacement for char::is_ascii*
-
diffside
A CLI tool for side-by-side file diffs with themed highlighting
-
regex-charclass
Manipulate and convert regex character classes
-
ib-pinyin
一个高性能拼音匹配库
-
html-query
jq, but for HTML
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
pathmut
Command line utility for manipulating path strings
-
vidyut-lipi
A Sanskrit transliterator
-
epcmanager
EPC text tool for RFID
-
reggy
friendly, resumable regular expressions for text analytics
-
hebrew_unicode_script
A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections
-
krafna
terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax
-
taos-query
Driver for TDengine - a timeseries database and analysis platform
-
doxygen-bindgen
Converts Doxygen comments into Rustdoc markdown
-
string-auto-indent
Normalizes multi-line string indentation while preserving platform-specific line endings
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
forbidden-bands
8-bit string handling library
-
dcsv
Dyanmic csv reader,writer,editor
-
tre-regex
Rust safe bindings to the TRE regex module
-
kathoey
text feminization using open corpus linguistics data
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
ponsic-winsafe
The dependency of the
ponsic
crate -
mdbook-aquascope
Interactive Aquascope editor for your mdBook
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
ascii_help
help you quickly convert ASCII codes
-
wit_ai_rs
An unofficial Rust crate for interacting with the wit.ai API
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
repr
The regular-expression-as-linear-logic interpretation and its implementation
-
djotters
Djot (Markdown) parser that runs at hyper speeds!
-
tree-sitter-stack-graphs-javascript
Stack graphs definition for JavaScript using tree-sitter-javascript
-
mds
A skim-based
*.md
explore and surf note-taking tool -
inlet_manifold
A general purpose highlighting library
-
rustyink
Blazing fast static site generator
-
html-compare
compare html files
-
make87_messages
Message Types for Rust SDK for make87 platform
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
our-string
Customizable shared strings with inlining
-
okkhor
English to Bangla phonetic conversion following the 'Avro' rules
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
dprint-plugin-markdown
Markdown formatter for dprint
-
catalog-of-markdown
Generate the catalog of markdown file
-
flowquad
that helps you build UI stuff with Macroquad
-
vi
An input method library for vietnamese IME
-
creature_feature
Composable n-gram combinators that are ergonomic and bare-metal fast
-
polars-compute
Private compute kernels for the Polars DataFrame library
-
puppet-fmt
Automatic code formatter for puppet manifests
-
armnod
random string generator
-
zhipuai-rs
zhipuai 's api of rust
-
case
A set of letter case string helpers
-
jedi
Juggernaut Electronic Data Interchange package. This library provides a data exchange layer extended through the holy crate.
-
string_more
Extension traits for
String
and&str
types -
nib-cli
A cli for a yet another static site generator Nib
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
wikibase
access Wikibase
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
booky
analyze English text
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
addbib
An app to add linked bibliographies to markdown files
-
mdbook-linkcheck2
A backend for
mdbook
which will check your links for you -
strloin
copy on write slices of a string
-
aneubeck-daachorse
Daachorse: Double-Array Aho-Corasick
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
advancedresearch-translate
translation or reading ancient texts in their original language
-
droid-wrap
用于Rust的Android API的高级封装
-
n_gram
training n-gram language models
-
md-tui
A terminal markdown viewer
-
rustclock
a stopwatch or timer cli made in rust
-
yara-x-fmt
A code-formatting library for YARA rules
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
lancelot
binary analysis framework for x32/x64 PE files
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
notion2html
Convert Notion pages to HTML
-
arf-strings
Encoding and decoding for ARF strings
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
unicount
Alphabetic counter supporting unicode
-
simstring_rust
A native Rust implementation of the SimString algorithm
-
hlight
dedicated to delivering exceptional syntax highlighting capabilities
-
fimdoc
Firendship is Magic Document, converts Markdown into FIMFiction BBCode
-
enma
serving anime and manga information 📦
-
indent_write
Write adapters to add line indentation
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
to_markdown_table
An easy way to format any data structure into a Markdown table
-
regexnight
Command-line tool to print syntax-highlighted versions of regular expressions and spot errors
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
encoding-next
Character encoding support for Rust
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
rush
shell
-
trxx
pack and unpack text files
-
cocomo
(Constructive Cost Model) CLI utility and library
-
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
jpush
集成极光App推送
-
flat_string
FlatString is fixed allocated size String that that can be created direcly on the stack
-
symbolic_expressions
A symbolic-expression parser/writer
-
clima
A minimal Markdown reader in the terminal
-
tergo-formatter
Formatter for tergo
-
text-editing
string with utilities for editing
-
unleash-types
API types for Unleash (https://github.com/Unleash/unleash) client features API response
-
epson
support for communicating with Epson brand thermal POS printers
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
badascii
Backend rendering library for BadASCII diagrams. Block diagrams in ASCII.
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
tbll
tbll outputs data in tabular format
-
utf16string
String types to work directly with UTF-16 encoded strings
-
hat-splitter
HAT splitter
-
ean-rs
generating and validating EAN barcodes
-
tu
CLI tool to convert a natural language date/time string to UTC
-
drova_sdk
Sdk for absolute converter of formats for dalet
-
filenamify
Convert a string to a valid filename
-
pdf
PDF reader
-
html-auto-p
function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
substring
method for string types
-
replaxe
A command-line tool to replace text in files with easy patterns
-
eliza
natural language processing program developed by Joseph Weizenbaum in 1966
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
frawk
an efficient Awk-like language
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
mdsh
Markdown shell pre-processor
-
stellar-axelar-std
Standard libraries for Axelar contracts
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
screeps-body-utils
Adds calculation functionality related to creep bodies in Screeps: World
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
libarc2
Low-level interface library for ArC TWO™
-
mdbook-repl
based mdbook preprocessor that allows you to execute code in your mdbook without any server. Python, Typescript, Javascript etc.
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
aki-gsub
substitude text command, replace via regex
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
lister-cli
Lister: Navigate Markdown Lists
-
kalosm-streams
A set of streams for pretrained models in Kalosm
-
fmtt
A diff-friendly text formatter that breaks lines on sensible punctuations and words to fit a line width
-
px-wsdom-ts-convert
wsdom crate
-
mkwebsite
build websites using markdown
-
prompt-input
lightweight library for user input prompts in Rust, designed to make input handling straightforward
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
text_utils_s
edit array. Example delete duplicate in array. Clear string
-
rwkv-tokenizer
A fast RWKV Tokenizer
-
erebus
A CLI message generation library
-
code_generator
A code generator (Currently only targets C)
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
deindent
A command line utility and Rust library to format overly-indented text
-
makepad-rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
ised
An interactive tool for find-and-replace across many files
-
renamer-rs
process and rename files or text
-
pulldown-html-ext-cli
CLI tool for extended HTML rendering of Markdown with pulldown-cmark
-
jom
convert JSON data to markdown by replacing placeholders with JSON values
-
tectonic
A modernized, complete, embeddable TeX/LaTeX engine. Tectonic is forked from the XeTeX extension to the classic “Web2C” implementation of TeX and uses the TeXLive distribution of support files.
-
keyphrases
Rapid Automatic Keyword Extraction (RAKE) implementation in Rust
-
maybe-regex
Wrapper for strings that may be either a regex or a plain-text string
-
utf16_lit
macro_rules to make utf-16 literals
-
alphabet_detector
Natural language alphabet detection library
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
wecom-agent
企业微信API的轻封装,让消息发送更加便捷。
-
somedoc
A very simple document model and markup generator
-
colored_text
adding colors and styles to terminal text
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
kalosm-language
A set of pretrained language models
-
mylibrary_
my personal library
-
morse_n_s
Test program that plays Morse code "N"s using Rust and CPAL, inspired by its use in historical aviation communications, including transmissions by Amelia Earhart
-
iregex
Intermediate representation for Regular Expressions
-
huozi
typography engine for CJK languages, especially designed for game rich-text
-
bubble-bath
Small and quick HTML sanitizer
-
domrs
Document builder and serializer
-
yore
decoding/encoding character sets according to OEM code pages
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
owned_str
Provide a stack allocated String for no-std or const environement
-
stam-tools
Command-line tools for working with stand-off annotations on text (STAM)
-
tform
format plain text into well-structured Markdown or HTML
-
fetch-catnip
fetch displaying system information and a cute cat
-
moto
motivated automation
-
readability
Port of arc90's readability project to rust
-
mdast_util_to_markdown
Markdown to AST
-
fast_symspell
Spelling correction & Fuzzy search
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
duvet
A requirements traceability tool
-
quake_text
Utils for Quake strings and characters
-
cedarwood
efficiently-updatable double-array trie in Rust (ported from cedar)
-
asciidoctor-client
A kludge to improve the performance of static site generators that use asciidoc through its cli
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
filename-refactor
Command to refactor file names
-
svgdx-pandoc
pandoc filter for svgdx codeblocks in Markdown
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
agentai
designed to simplify the creation of AI agents
-
overlap-chunk
splitting text into chunks of specified size with adjustable overlap percentage
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
re_ui
Rerun GUI theme and helpers, built around egui
-
lnk
parse and write Windows shortcut files (.lnk)
-
mdbook-merjong
A preprocessor for mdbook to add merjong support
-
mnm
Mnemonic sentences for BitTorrent info-hashes
-
sgrep
grep util for those lazy to remember many command line options
-
libchai
汉字编码优化算法
-
editdistancek
Fast algorithm for computing edit distance
-
lexical-sort
Sort Unicode strings lexically
-
rust_file_encode_mode_convert
这是一个rust的库,用于检测文件的编码格式。 支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。
-
ADA_Standards
help you handle checks on your ADA projects, especially good to build scripts to check coding standards conformity
-
notion2md
converting Notion pages to Markdown
-
quicksilverx
easy to use grep clone
-
uwurs
UwUify your strings with uwurs!
-
unaccent
remove accents from strings, inspired by PostgreSQL's unaccent extension
-
xfont
font query
-
array_tool
Helper methods for processing collections
-
corlib
A various ideas library
-
cbfr
A buffer that run on stack, focusing on performance and speed
-
share-clipboard-rs
A lightweight, cross-platform utility written in Rust to seamlessly share your clipboard content across multiple devices on your local network
-
aristech-nlp-client
client library for the Aristech Natrual Language Processing API
-
pulumi_gestalt_core
Core Pulumi Gestalt implementation
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
mdopen
Preview markdown files in a browser
-
mdbook-nice
A mdbook plugin to add nice css to your book
-
rustfmt-nightly
find and fix Rust formatting issues
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
iregex-syntax
Common syntax for regular expressions
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
tiefdownlib
manage and convert TiefDown projects
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
swamp-vm-instr-build
builds opcodes for the swamp vm
-
enc-check
inspect utf-8 and utf-16 character encodings
-
subtitler
parsing and generating subtitles
-
mdlink
Auto-convert HTTP links for your favorite services into nice Markdown links
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
linebreak
breaking a given text into lines within a specified width
-
presenterm
A terminal slideshow presentation tool
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
crankshaft-config
Configuration facilities for Crankshaft
-
syllabize-es
Syllabize Spanish text, and much more
-
gte-rs
Text embedding and re-ranking pipelines
-
syncgit
Una herramienta CLI para sincronizar repositorios Git
-
colonnade
format tabular data for display
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
lgtmeow
🐾 —— 「本喵觉得很不错~」
-
rustpython-parser-vendored
RustPython parser vendored third-party crates
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
cloc
Count, or compute differences of, lines of source code and comments
-
extract-strings
Extract ascii strings from files
-
raylib_interactive
An interactive library for Raylib
-
chord3
Create pdf songbooks from chopro source
-
hubble
Official Hubble plugin SDK for Rust
-
tag2upload-service-manager
Debian tag2upload service manager
-
jawk
JSON AWK
-
aho-corasick-unsafe
Fast multiple substring searching
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
asdi
Simplistic Datalog Implementation (in Rust)
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
reason-shell
Reason: A Shell for Research Papers
-
lyon_extra
Various optional utilities for the lyon crate
-
src2md
Turn source code into a Markdown document with syntax highlighting, or extract it back
-
ahtml-from-markdown
Convert Markdown to ahtml HTML element trees
-
emoji
Every emoji, their metadata, and localized annotations
-
mchr
Lenient implementations of encodings. Zero allocations, zero dependencies!
-
fast_whitespace_collapse
Collapse consecutive spaces and tabs into a single space using SIMD
-
httpwg
Test cases for RFC 9113 (HTTP/2)
-
scx_rustland
BPF component (dispatcher) that implements the low level sched-ext functionalities and a user-space counterpart (scheduler), written in Rust, that implements the actual scheduling policy…
-
string-replace-all
String replacement utility inspired by JavaScript, allowing pattern-based substitutions with support for both exact matches and regex patterns
-
furze
finite state transducers (fst) writen in rust
-
libannict
Annict API のクライアントライブラリ
-
soft-ascii-string
char/str/string wrappers which add a "is-ascii" soft constraint
-
caseless
Unicode caseless matching
-
cai
The fastest CLI tool for prompting LLMs
-
sydney
Vim-like, Command-line Gemini Client
-
nuhound
Improve error handling capability
-
wai-parser
Parser for WAI syntax
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
deliminator
Universal code documentation generator
-
rins_markdown_parser
markdown parser written on Rust
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
vidyut-kosha
A Sanskrit key-value store
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
expunge
redact and transform struct fields declaratively
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
cronus_spec
The definitions for cronus API spec
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
typeshare-cli
Command Line Tool for generating language files with typeshare
-
cli_app_capo
CLI application with Unix-like tools
-
iroh-test
Internal utilities to support testing of iroh
-
maelstrom-plot
Fork of egui_plot with added stacked line graph functionality
-
uklatn
Ukrainian Cyrillic transliteration to Latin script
-
poppler
Wrapper for the GPL-licensed Poppler PDF rendering library
-
bobo_html_parser
parser of html markdown
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
ere
A compile-time alternative for POSIX extended regular expressions
-
wit-bindgen-markdown
Markdown generator for WIT and the component model, typically used through the
wit-bindgen-cli
crate -
abbreviation_extractor
extracting abbreviations from text
-
extstd
intended as an extension of the standard library
-
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
linkcheck2
extracting and validating links
-
turn-uppercase
Small command to uppercase text in command line and copy to clipboard
-
asimov-config-cli
ASIMOV Configuration Command-Line Interface (CLI)
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
docket
markdown to HTML documentation rendering
-
pullup
Convert between markup formats
-
lowcharts
draw low-resolution graphs in terminal
-
vyder_std
Standard library for vyder
-
ctreg
Compile-time regular expressions the way they were always meant to be
-
uniquewords-rs
Count the frequencies of words in text file(s) or stdin
-
html-linter
An HTML linting library for checking HTML structure and semantics
-
gregex
Regex solver utilizing NFA
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
named_entity_parsing
Named entity parser. Used in Rusev to parse a list of tokens into a list of entities.
-
fontspector-profile-universal
Fontspector checks for OpenType font best practices
-
context-notation
Featherweight semantic notation for text
-
notan_draw
2D API for Notan
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
pingmoji
Useless CLI utility that parses chains of emojis and bitwise operations as ipv4 addresses and pings the result
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
minigrep_jeck
minigrep
is a grep clone that takes a query and searches for the query in the file; with added support for regex -
facebookexperimental/hgproto
A Scalable, User-Friendly Source Control System
-
scraps_libs
Scraps is a static site generator based on Markdown files written with simple Wiki-link notation, designed for personal and team knowledge management
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
ascii-canvas
canvas for drawing lines and styled text and emitting to the terminal
-
stylish-ansi
stylish
helpers for writing styles as ANSI escape codes -
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
like
A SQL like style pattern matching
-
appendlist
An append-only list that preserves references to its elements
-
bump-bin
Increments version with semver specification
-
ens-normalize-rs
Ethereum Name Service (ENS) name normalization
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
fmtm
A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width
-
portmanteau
create portmanteaux
-
wdl-ast
An abstract syntax tree for Workflow Description Language (WDL) documents
-
runi
a CLI tool to generate unicode fonts
-
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
meddl_translate
Translate German to Meddlfrängisch
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
tailcall-valid
validating multiple inputs, collecting all possible errors instead of failing at the first error. Useful for scenarios where comprehensive feedback is required for user inputs or configuration settings.
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
spc-core
A command-line tool for processing and analyzing data from SPC files
-
pandoc_types
Rust port of pandoc-types
-
sqdj
sqdj shortens delimited data
-
cnv
Command-line tool to convert between units of measurement
-
codetypo-vars
Source Code Spelling Correction
-
vibrato
viterbi-based accelerated tokenizer
-
widget-forge
A Widget Based Application Engine for Ascii-Forge
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
hanconv
Convert between Chinese characters variants
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
b2c2-jis-x-201
UTF-8とJIS-X-201を雑に変換処理する
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
gst-plugin-regex
GStreamer Regular Expression Plugin
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
gaoya
Locality Sensitive Hashing Data Structures
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
formatjson
Formats JSON files
-
hidden_watermark
Hidden Watermark in Rust
-
zp
Copy the contents of the source file or the standard output buffer to the clipboard, with support for maintaining a history of copied content, allowing users to easily paste into another file or program
-
scrunch
full-text-searching compression
-
mdi
markdown include
-
mdbook-fs-summary
Summary generator for mdbook
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
glean
SDK Rust language bindings
-
ascii-hangman
customizable Hangman game with ASCII-art rewarding for children (desktop version)
-
natural
Pure rust library for natural language processing
-
pavex_miette
A custom Miette theme for Pavex CLI errors
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
trust_pdf
Verifies signed PDFs against the originals, checking for sneaky modifications
-
unicodeit
Converts LaTeX to Unicode (rust port)
-
webspeeddial
A dial system for websites
-
monument_cli
CLI interface to Monument, a fast and flexible composition generator
-
rustdoc-md
Convert Rust documentation JSON into clean, organized Markdown files
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
html_to_epub
A command line converts .html file to .epub file
-
pray
A tui tool for preparing a prompt to the llms
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
pukram2html
converting Pukram-formatted text to HTML
-
fyi_ansi
Compile-time ANSI formatting macros for FYI
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
mdbook-langtabs
An mdbook preprocessor that adds language tabs for code blocks
-
regex_generate
Use regular expressions to generate text
-
prescript
parsing and executing Prescript scripts
-
text-tokenizer
Custom text tokenizer
-
tfidf-text-summarizer
extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
gspell
Rust bindings for gspell
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
owlz
"Owlz" ascii emojis, created randomly or by design
-
json-predicate
JSON Predicate lib based on draft-snell-json-07
-
scx_rusty
multi-domain, BPF / user space hybrid scheduler used within sched_ext, which is a Linux kernel feature which enables implementing kernel thread schedulers in BPF and dynamically loading them…
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
cqtool
converting between CQ strings and message segment arrays
-
natord-plus-plus
Natural ordering for Rust
-
rvpacker-txt-rs-lib
providing functions for rvpacker-txt-rs
-
ncase
Enforce a case style
-
rs3a
Lib for reading and writing 3a format
-
fr_alebref_libbrefdata
BrefData library
-
enum-ts
TypeScript Enum pattern matcher codegen
-
rawcode
Implements a simple as-is encoding format
-
scanlex
lexical scanner for parsing text into tokens
-
toster
A simple-as-toast tester for C++ solutions to competitive programming exercises
-
wikipedia_prosesize
Count Wikipedia prose size
-
phonetisaurus-g2p
Phonemization in Rust using a finite state transducer (FST) trained with Phonetisaurus
-
iregex-automata
Finite automata definitions for the
iregex
crate -
parse2csv
parse log-file and output to stdout as csv file by regex
-
libanubhav
management system written in Rust
-
naming_utils
generating naming conventions, pluralizing words, and rest api paths in Rust
-
nu_plugin_regex
nu plugin to search text with regex
-
jumpcut
CLI for converting Fountain-formatted text files into FDX and HTML formats
-
biometrics
provide the vitals of a process in the form of counters, gauges, moments, and T-digests
-
markdown-extract
Extract sections of a markdown file
-
writings
The Bahá’í Sacred Writings for use in Rust projects and APIs
-
xml_magic
A reasonably fast XML formatter
-
firm_netter
测试,请勿使用!
-
CLI_Project_Scott_Coakley
CLI Project in Rust
-
fmri
IPS package identifier - FMRI
-
cozo
A general-purpose, transactional, relational database that uses Datalog and focuses on graph data and algorithms
-
mdbook-github-authors
mdbook preprocessor to display Github profiles of authors of a page
-
mcp-spec
Core types for Model Context Protocol
-
greek_number
Convert numbers to Greek number strings
-
clone-spl-token-metadata-interface
Solana Program Library Token Metadata Interface
-
check_build
verify a VCF file against hg19 and hg38 references using a streaming, low-memory approach
-
yahv
hex viewer
-
thesaurus
An offline thesaurus library for Rust
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
mitex
TeX2Typst converter
-
noctisroll
Text-based TRPG dice rolling system
-
timeblok
A language for event scheduling in plain text
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
ps-str
String transcoding library
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
markdown-toc
Markdown Table of Contents generator
-
stringer
An easy way to turn an Unsafe *const c_char into a Rust String type and return a pointer
-
modularize_imports
AST Transforms for import modularizer
-
kproc
Knowledge Processing library
-
mdbook-ai-pocket-reference
mdbook preprocessor for the ai-pocket-reference project
-
hephae
A personalized, opinionated Bevy plugin that adds support for drawing and batching arbitrary vertices and indices
-
treebender
An HDPSG inspired symbolic NLP library for Rust
-
ascii-izer
converting an image into ASCII art
-
kdump
A small utility that disassembles and reads KSM and KO files for use with KerbalOS
-
yozuk
Chatbot for Programmers
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
spezilinter
spezifisch's linter for different file formats, linting for weirdly specific stuff
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
strs_tools
Tools to manipulate strings
-
wdict
Create dictionaries by scraping webpages or crawling local files
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
crud
CLI generator for your favorite CRUD REST API
-
slugify-rs
generate slugs from strings
-
bitutils2
A package of tools for bit manipulations, including bit indexing, bitfields, and a variation of regular expressions for binary data
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
meme_generator_memes
Meme generator built-in memes
-
py-regex
A small wrapper around the Python regex module via PyO3
-
treegrep
A pattern matcher frontend or backend which displays results in a tree
-
textgridde-rs
dealing with Praat TextGrid files. MIT licensed.
-
zipcodes
Query US zipcodes without SQLite
-
swon-parol
SWON parser implementation using Parol
-
modeling
tools to analysis different languages by Ctags
-
draw_triangle
A CLI tool to draw equilateral triangles in the terminal
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
asciidork-tck
Asciidork TCK Adapter
-
hemoglobin
Bloodless
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
markdown-to-html
Markdown parser that runs at hyper speeds!
-
unicode-matching
match Unicode open/close brackets
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
typope
Pedantic source code checker for orthotypography mistakes and other typographical errors
-
diffy-imara
Tools for finding and manipulating differences between files
-
cynic-querygen
Generates code for using cynic from GraphQL query input
-
trillium-prometheus
Trillium handler for Prometheus metrics scrapes
-
nipdf-reader
iced pdf GUI reader
-
easy-regex
Make long regular expressions like pseudocodes
-
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
codebook_downloader
Dictionary downloading utility for the Codebook spell checker
-
samvadsetu
LLM API for commonly used LLM services including Gemini, ChatGPT, and Ollama. The name implies a bridge for dialogue since the library facilitates communication and interaction between…
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
simple-logging
logger for the log facade
-
rmosh
R6RS & R7RS Scheme Interpreter
-
ascii-webcam
A webcam that visualizes its output as ASCII art directly in the terminal
-
tracery
Text-expansion library
-
metatron
core library
-
markov_str
Markov Chain implementation optimized for text generation
-
streampager
pager for command output or large files
-
rsrusl
A really simple useful library ported to Rust
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
zz-data
Data structures for Zanzarah apis
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
piet-cosmic-text
A text layout engine for piet based on cosmic-text
-
utf8toipv4
Convert UTF-8 to ipv4 addresses and vice versa
-
pulldown_mdbook
A pull parser for mdBook
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
obmrs
As a participant, you will create a structure to receive and hold the exchange-distributed order book. This structure will be called the OrderBoard, and will hold the order book's bids and asks as a price-sorted map…
-
tgrep
Toy grep that honors .gitignore
-
annotate_celeste_map
rendering celeste maps, and overlaying recorded paths, lobby entrances etc
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
choco
markup language for dialogue systems
-
binatime
A binary clock in the terminal
-
goofy-animals
Generate a name in adjective-adjective-animal form
-
stringutil
A collection of useful string utilities
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
antex
Styled text and tree in terminal
-
aho-corasick
Fast multiple substring searching
-
keyvalues-parser
A parser/renderer for vdf text
-
format-bytes
A macro to format bytestrings
-
franz
friendly, and blazingly fast alternative to Apache Kafka
-
interactive-clap
Interactive mode extension crate to Command Line Arguments Parser (https://crates.io/crates/clap)
-
yara-x-parser
A parsing library for YARA rules
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
skyspell_kak
skyspell - kakoune integration
-
nlf
A CLI to append newline characters (LF) at the end of text file
-
orion_cfmt
Format output without Rust code segment in binary to reduce the ultimate binary size
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
acorns
Generate an AsciiDoc release notes document from tracking tickets
-
regex-chunker
Iterate over the data in a
Read
type in a regular-expression-delimited way -
mdbook_header_footer
mdBook preprocessor to prepend header and append footer to certain chapters
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
realhydroper-utf16
Work with UTF-16 in Rust
-
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
pulldown-html-ext
Extended HTML rendering capabilities for pulldown-cmark
-
cali
A terminal calculator with real-time evaluation, unit conversions, and natural language expressions
-
sbert
Sentence Bert (SBert)
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
grammalecte_client
Grammalecte HTTP client
-
rslint_parser
An extremely fast ECMAScript parser made for the rslint project
-
pprint
Flexible and lightweight pretty printing library for Rust
-
swc-neuron
CLI utility for interacting with SWC neuronal morphology files
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
freesia
some string operators
-
typos-cli
Source Code Spelling Correction
-
gstring
String with support for Unicode graphemes
-
gulagcleaner_rs
Ad removal tool for PDFs
-
ainconv
Converts Ainu words between different scripts (Katakana, Latin, Cyrillic)
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
man
Generate structured man pages
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
fhe
Fully Homomorphic Encryption in Rust
-
lformat
Clone of Lua
string.format
in Rust based on Cs(n)printf
-
ironsmith-parser
Transforms Smithy 2.0 IDL files into an abstract syntax tree
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
eqlog-eqlog
Datalog with equality
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
ipynb-to-md
Convert Jupyter Notebooks to Markdown files
-
simple-ssg
Plain and simple static site generator for Djot and Markdown light markup languages
-
schemaorg-rs
Generated Rust types from Schema.org JSON-LD vocabulary
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
markdown-it-footnotes
Creates footnotes and lists of footnotes in Markdown documents
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
myanmar_util
A collection of tools for processing Myanmar text including syllable breaking and other utilities
-
r4d
Text oriented macro processor
-
codepage-strings
encode / decode strings for Windows code pages
-
rustfmt_lib
Rustfmt as a library
-
rust-regex-dsl
Regular expression DSL
-
i_shape_js
Boolean Operations for 2D Polygons. Supported operations: intersection, union, difference, XOR, and self-intersections for all polygon varieties.
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
recase
Changes the convention case of input text
-
jfmt
command-line tool for formatting json files in both readable and compact formats. It supports stdin/stdout shell usage, as well as working on files directly.
-
rsnltk
Rust-based Natural Language Toolkit
-
viterbi_pos_tagger
A part-of-speech (POS) tagger using the Viterbi algorithm
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
lookbook
Component preview framework for Dioxus
-
ruby_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
shaperglot
Test font files for OpenType language support
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
nfa_regex
NFA regex engine for text processing
-
lindera-decompress
A morphological analysis library
-
stylin
Convert markdown to pandoc markdown with custom styles
-
ironcalc_base
Open source spreadsheet engine
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
esl01-minibytes
Shared reference-counted bytes with zero-copy slicing support
-
symspell
Spelling correction & Fuzzy search
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
stylish-style
Internal implementation details of
stylish-core
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
fast_trie
A memory efficient trie library
-
dysql-tpl
Experimental Mustache-like templating engine
-
zen-rs
generating non-interactive content like cards or files
-
katex
Rust bindings to KaTeX
-
rust_jsc_sys
Low-level bindings to JavaScriptCore
-
english
language decliner
-
latex-packer
CLI that goes though the file and subsequent \input, \include and packs all the content to the single output file
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
wtf8-rs
WTF-8 encoding
-
libretranslate
A wrapper for the LibreTranslate web API
-
basalt-core
core functionality for Basalt TUI application
-
lsp-ty
type definitons for LSP
-
ascii-img
Convert images to ASCII
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
string-overlap
A helper crate for "layering" ASCII art
-
m2h
Convert Markdown to HTML with syntax highlighting
-
record-query
doing record analysis and transformation
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
langlang_value
langlang is a parser generator based on Parsing Expression Grammars (library)
-
verba
working with Latin words
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
image-to-ascii
Converts images and gifs to ascii art
-
mdbook-plugin-utils
mdBook plugins
-
ipa-translate
translating between IPA and ASCII text
-
rusty_regex
A minimalistic regex engine in Rust using the pipeline: Regex -> AST -> NFA -> DFA -> Match(String)
-
mdbook-typst-pdf
mdbook typst pdf backend
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
malvolio
programmatically generating HTML
-
mdzk
Plain text Zettelkasten based on mdBook
-
f-tree
the cli app to execute the commands from forester
-
loc
Count lines of code (cloc) fast
-
anslatortray
translate from English to Pig Latin!
-
merge3
merge tool for three-way merges
-
heart-strings
Quickly get random heart emojis to copy!
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath
. -
diff-man
diff utility lib
-
regex_quote_fixer
Rewrites grep regexpressions for the use in the regex crate
-
file-expert
Expert system for recognizing source code files, similar to GitHub/lingust
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
lammps-analyser
A CLI tool and language server for LAMMPS simulation input scripts
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
typeline_ext_utils
operators for typeline
-
leptos-markdown
A component which can render markdown as html element in leptos
-
afrim-translator
Manage the predication system of the afrim input method
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
spf
.spf (Simple Pixel Font) file parsing, and useful api's to go alongside
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
mdbook-llms-txt-tools
convert mdbook to llmstxt.org format
-
rk-utils
A collection of utility functions and data structures for rust
-
gliclass-rs
Inference engine for GLiClass models
-
avatarsay
Beautiful quotes from Avatar: The Last Airbender
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
simplers
Simplification of too complex stuff in rust
-
skyline
helping patch and modify Nintendo Switch games
-
ascii-img-cli
Command-line tool for using ascii-img
-
commit_crafter
AI powered tool for Git commit message generator
-
texcore
Create LaTeX documents using native Rust types
-
rust-texas
generate latex documents
-
rrename
" Opinionated tool to rename files in batch. Match regular expression, replace some characters I consider noise to kebab case
-
unified-diff
GNU unified diff format
-
split-identifier
Rust package that provides functions to split programmatic identifiers according to case conventions
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
loki_text
advanced string manipulation with pattern searching and replacement capabilities
-
vec-embed-store
thin wrapper around LanceDb (VectorDb) meant to provide a means to create/store/query embeddings in a LanceDb without the need to grok the lower level Arrow/ColumnarDb tech
-
textspan
Text span utility
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
rustc-demangle-capi
C API for the
rustc-demangle
crate -
bep-core
An opinionated library for building LLM powered applications
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
mrdirector
A narrative game development package for the Turbo Game Engine
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
quickner-core
A fast and simple NER tool
-
etch
Not just a text formatter, don't mark it down, etch it
-
cursive-async-view
A loading-screen wrapper for gyscos/cursive views
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
hexroll3-scroll
HEXROLL3 Scroll - the sandbox content generator
-
ewts
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (lib)
-
unicode-casing
Titlecase helper function on characters
-
random_ascii
A random [rainbow] ascii-art picker which matches your current terminal size
-
annotated-string
String with ability to annotate (format) its individual fragments
-
regex-split
split_inclusive for the regex crate
-
atlaspathwaysai
An opinionated library for building LLM powered applications
-
opstr
‘Operate on strings’ command line utility
-
slicestring
slicing Strings
-
hyper-static-server
friendly library to build static servers with hyper HTTP server
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
tagsearch
Filter plaintext files based on @keyword tags
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
tree-sitter-stack-graphs-python
Stack graphs definition for Python using tree-sitter-python
-
google-book-scraper
downloading the contents of books hosted on books.google.com for offline viewing
-
uclanr
A random word picker that gives you actually useful words
-
mask-text
mask text with multiple masking options
-
text_lines
Information about lines of text in a string
-
ob
A Blog and RSS system written in Rust
-
sugarloaf
Rio rendering engine, designed to be multiplatform. It is based on WebGPU, Rust library for Desktops and WebAssembly for Web (JavaScript). This project is created and maintained for…
-
ultra-nlp
A NLP library
-
tuilet
A textual user interface for Toilet, the ANSI-art text generator
-
mdbook_ls
mdBook Language Server
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
committable
Keccak256-based structured commitments
-
lspt
Language Server Protocol (LSP) types made easy
-
interslavic
in rust
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
rnltk
Natural Language Toolkit for Rust
-
gazetta-render-ext
A static site generator framework. Extra render code.
-
gds21
Integrated Circuit Layout Parser & Writer
-
epub2mdbook
convert EPUB files to MDBook format
-
ornn
Gen const for smart contract
-
scie
research about how to build simple code identify engine for different languages
-
re_view_text_log
A view that shows text entries in a table and scrolls with the active time
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
odict
A blazingly-fast dictionary file format for human languages
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
ammonia
HTML Sanitization
-
cli_app_capo15
CLI application with Unix-like tools
-
story-dl
Story web scraping
-
warrah
command-line utility and Rust library that sloppily removes code comments from a text file, supporting 60+ programming languages
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
unic-emoji-char
UNIC — Unicode Emoji — Emoji Character Properties
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
font-map
Macros and utilities for parsing font files
-
geoipsed
Inline decoration of IPv4 and IPv6 address geolocations
-
royal_road_archiver
An archival program and library for the webnovel site RoyalRoad
-
rfsee-tf-idf
TF-IDF implementation for rfsee
-
nib
static site generator
-
escrit
learning languages by reading texts
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
xml_writer
writes xml, not pretty, but faaast
-
utf8streamreader
lookahead iterator on an utf8 byte stream
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
asimov-construct-cli
ASIMOV Construct Command-Line Interface (CLI)
-
select-html
Extract HTML using CSS selectors in the command-line
-
incredimo
just another font for your terminal
-
fsays
flavored replacement for the classic cowsay
-
md-bakery
Markdown Bakery CLI app
-
kicad-text-injector
A tiny CLI tool that replaces variables of the style
${KEY}
within KiCad PCB (pcbnew) files -
bukvalno
A cli tool for converting images to ascii art
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
text-utils
Text utils for unescaping and align
-
mdbook-najan
Preprocessor for the Najan mdBook
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
rst_parser
a reStructuredText parser
-
crustword
Crusty Crosswords
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
char-ranges
Iterate chars and their start and end byte positions
-
zummi
fun lib that produces spoonerisms
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
aki-mline
match line, regex text filter like a grep of linux command
-
baste64
A base64 codec
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
neo-mime
Strongly Typed Mimes
-
dnd-character
A Dungeons and Dragons character generator
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
rusty-axml
A parser for Android AXML files
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
maudit
Framework for generating static websites
-
perm-text
curling straight/dumb quotation marks ("") and apostrophes (') into their curly/smart (“”’) equivalents
-
schmfy
Schmfication library
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
roxy_markdown_parser
Roxy plugin for parsing Markdown
-
reedy
A terminal-based RSS reader with a clean TUI interface
-
grepster
command-line tool for searching text in files
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
fluxus
lightweight stream processing engine written in Rust, designed for efficient real-time data processing and analysis
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
nlprule
A fast, low-resource Natural Language Processing and Error Correction library
-
rust_cascade
bloom filter cascade implementation in Rust
-
lucide-rs
Provide lucide icon for rust
-
topfew
CLI to find high frequency occurrences in structured text files
-
komga
REST API Client generated from OpenAPI specification
-
mairs
was created for simply programing CLI programs, with the simplest console graphical interface
-
nesty
Generate code with with human readable indentation
-
generic_symbolic_expressions
fork of symbolic-expressions, which tweaks it to be more normal. The original crate had weird rules around putting extra double quotes.
-
pygmentize
wrapper for syntax highlighting
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
bstr
A string type that is not required to be valid UTF-8
-
just-enough-emojis
text to emoji cli
-
subject-classifier
classifying a commit by it's subject
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
markdown-it-heading-anchors
A markdown-it plugin for parsing GFM tasklists
-
knock-knock
CLI tool for obtaining and outputting domain name information in an easy-to-read format
-
aconv
Converts texts from the auto-detected encoding to UTF-8 or a specified encoding
-
esoteric-vm
An esoteric virtual machine
-
akiaki
A good old fashioned wiki engine with a flat-file database
-
mdplayscript
An extension of Markdown for play scripts
-
path2regex
Express style path to RegExp utility
-
knowledge
The launcher app for the interacive book
-
imperative
Check for imperative mood in text
-
fip
Field Parser, roughly emulating "awk '{print $<field-number>}'"
-
pattern-generator
solving Sudoku puzzles. It takes a Sudoku puzzle input and provides the solved grid.
-
edit
Open a file in the default text editor
-
lightning-path
Route Recognizer library for lightning-fast matching
-
garde-fr
Validation library
-
st7789_rs
A driver and graphics library for st7789 displays, primarily used on a Raspberry Pi
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
runestr-pancjkv
rune
-based Pan-CJKV support -
enpsrlib
English Phrase Structure Rules library
-
supercat
A syntax highlighting alternative to cat
-
nanoid-dictionary
Popular alphabets for use with nanoid
-
flashtext2
The FlashText algorithm implemented in Rust
-
mastodon-async-entities
Types for (de)serializing entities from the Mastodon API; part of mastodon-async
-
human_regex
A regex library for humans
-
matchpick
Find and replace multi-lines using a match-case
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
bdf2surface
convert bdf font to sdl2 surface
-
comment-strip
Remove comments out of text files
-
ripmors
encoding and decoding international Morse code and several variants
-
text_unit
Newtypes for text offsets
-
nugine-rust-utils
Nugine's personal Rust utilities
-
himmelblau_kerberos_keytab
parse keytab kerberos files
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
cutters
Rule based sentence segmentation library
-
commonregex
Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.
-
radix_trie
Generic radix trie data-structure
-
roman_numerals_fn
A function to convert integers to their roman numeral representation as strings. Values from 1 to 3999 are possible, otherwise it returns an OutOfRangeError. Zero has no representation in roman numerals.
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
hitori
Generic compile-time regular expressions
-
twitter-text
in Rust
-
rep-grep
wgrep/write-grep CLI
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
wai-bindgen-gen-markdown
Generate Markdown API docs for a WAI interface
-
apple-notes-exporter
CLI tool for exporting Apple Notes to Markdown
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
inslice
A command-line utility for filtering text input by columns and rows
-
tabiew
A lightweight TUI application to view and query tabular data files, such as CSV, TSV, and parquet
-
simple_peg
A command-line peg parser implemented by Rust
-
ropey
A fast and robust text rope for Rust
-
chunk_norris
splitting large text into smaller batches for LLM input
-
zhlint
A linting tool for Chinese text content
-
compact_str
A memory efficient string type that transparently stores strings on the stack, when possible
-
lindera-cli
A morphological analysis command line interface
-
pillar
small tool to format lines into columns
-
crate-starter
starter
-
xml1
sane, non compliant xml parser without allocations
-
is_printable
Determine whether a given text-based value is printable
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
hexstring
handling hexadecimal string
-
squishyid
Shorten and obfuscate IDs
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
fj-text
creating text in fornjot
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
libefi-sys
Bindings for libefi on illumos
-
refac
Transform some text given a generic natural language prompt
-
spider_scraper
A css scraper using html5ever
-
acridotheres-3ds
Nintendo 3DS-specific file formats for Acridotheres
-
xee-ir
Xee intermediate representation and compilation to bytecode
-
libflagup
Display a country's flag as an emoji
-
pretty
Wadler-style pretty-printing combinators in Rust
-
chinese
language nlp tools
-
archive-pdf-urls
Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
popgetter-core
Core library for popgetter
-
reg_match
A match style regex tool
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
cow-rewrite
Rewrite copy-on-write types copying only when it's neccessary
-
forestrie-builder
Build a trie and convert it TokenStream
-
texting
string helpers
-
committed
Nitpicking commit history since beabf39
-
repa
Peak Performance Pattern Seeker
-
ptero-cli
A text steganography CLI tool for Social Media
-
regexgrep
ripgrep tool that suports regular expressions
-
jposta
A fast and intuitive Terminal User Interface (TUI) tool for searching Japanese postal codes and addresses
-
pra
Print Random ASCII
-
inboxbot
A telegram bot to save messages to a file
-
enso-lazy-reader
An efficient buffered reader
-
notegraf
Core library for building a graph-oriented notebook
-
eml-codec
Email enCOder DECoder in Rust. Support Internet Message Format and MIME (RFC 822, 5322, 2045, 2046, 2047, 2048, 2049).
-
xmlwriter
streaming XML writer
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
csv_to_table
pretty print CSV as a table
-
align
aligning text
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
pdf-sign
extract signed date from pdf file
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
altium
processing Altium file types
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
correct_word
A No brainer 'did you mean' library for Rust
-
ranting
Linguistic formatting placeholder extensions for rust
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
text_layout
Text layout algorithms
-
strange
A static website generator
-
twitter-stream-message
Types for Twitter Streaming API's messages
-
scraper
HTML parsing and querying with CSS selectors
-
tzgrep
grep tar.gz
-
codetypo-cli
Source Code Spelling Correction
-
revstr
Simply reverses strings
-
vngineer
Visual Novel game engine
-
hunspell-rs
Rust bindings to the Hunspell library
-
name-engine
computing Markov chains to generate random names based on pronunciation
-
vndb-api
Fully Functional Visual Novel Database (VNDB) HTTPS API Wrapper
-
string-simple
containing some simple string utilities that I use in my other projects
-
semchunk-rs
A fast and lightweight Rust library for splitting text into semantically meaningful chunks
-
wdl-doc
Documentation generator for Workflow Description Language (WDL) documents
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
betacode
conversion
-
kvarn-chute
A Markdown converter designed to use the Kvarn templating engine
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
google-fonts
Download and cache TTF fonts from Google
-
uwu-rs
uwuifying library
-
detone
Decompose Vietnamese tone marks
-
mdbook-check-missing-md
A backend for
mdbook
which will find Markdowns you forgot on SUMMARY.md -
ultron-syntaxes-themes
Syntaxes and themes dump for ultron
-
zoitei
alphabet conversions
-
dokkoo
Mokk (Macro Output Key Kit) implementation written in Rust
-
is-vowel
Heuristically test whether a character is a vowel letter
-
fmty
Composable
core::fmt
utilities -
segtok
Sentence segmentation and word tokenization tools
-
sixbit
Small packed strings
-
hoedown
bindings for the Hoedown markdown processor
-
panda-re-sys
The official *-sys library for interfacing with PANDA (Platform for Architecture-Neutral Dynamic Analysis)
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
oui-data
looking up information from the IEEE OUI database
-
url-pattern
VERY INCOMPLETE implementation of the WhatWG URL Pattern standard https://https://urlpattern.spec.whatwg.org/. Seriously DON’T USE THIS (yet)!
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
trevordmiller
Personal CLI
-
rusticsearch
A lightweight, Elasticsearch-compatible search server (early WIP)
-
translitrs
Transliteration utility for Serbian language
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
textcat
detect text categories. It can be used to detect the language of a given text
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
basic-text-internals
Basic Text string literal implementation details
-
squidge
shortens delimited data
-
mdbook-indexing
mdbook preprocessor for index generation
-
rustrings
Strings manipulation for Rust
-
chunkr
A fast and quick chunking library for rust
-
uci-parser
Universal Chess Interface parser
-
fx-mistral
leverage the Mistral API for OCR and data extraction from PDFs
-
mojibake
Encode/Decode bytes as emoji base2048
-
unicode-width-16
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
unic-char-range
UNIC — Unicode Character Tools — Character Range and Iteration
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
yotasm
Assembler for my 16 bit CPU
-
charclass
define and modify unicode character classes
-
markov_strings
A simplistic Markov chain text generator
-
bk-tree
A Rust BK-tree implementation
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
cur
that will hunt for your regular expression
-
minigrep_baolhq
Just getting started with Rust, enjoying it so far 😇
-
markdown-extract-cli
Extract sections of a markdown file with a regular expression
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
liwe
IWE core library
-
langsan
sanitizing language model input and output
-
typship
A cli for typst packages
-
korean_regex
Regex extension for Hangeul analysis
-
lazy-string-replace
A lazy version of
String::replace
, so that it can be formatted or recursively replaced without intermediate allocations -
aws-sdk-cognitoidentityprovider
AWS SDK for Amazon Cognito Identity Provider
-
base_u256
base-u256 is to utf-8 as base-64 is to ascii
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
japhonex
Japanese phone number checker for Rust
-
searcher_txt
A copy of grep that i made to show that im bad at rust
-
ltxcut
formats a table-like stream into a LaTeX-table
-
spigot
parser for valve's keyvalue file format (gameinfo.txt, vmt, etc.)
-
magic_string_rain
magic string
-
html2runes
An HTML to Text converter
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
delay_writer
Wraps a writer and delays its output after each newline
-
roan-engine
The core engine for the Roan project
-
awabi
A morphological analyzer using mecab dictionary
-
xhtmlchardet
Character set detection for XML and HTML
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
ucd-raw
Uninterpreted access to the unicode UCD
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
johalun/module
FreeBSD kernel module in Rust
-
jcc
Convert Juniper configurations to set-style
-
markdown-table
Creating markdown tables with Rust!
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
ascii-pixel
Convert pixel art into ascii images
-
cerpton
A 'double' Caesar Cypher
-
hello_lib
Demonstrate Generics Function
-
beans
A parser generator library based on the Earley parser
-
mdbook-keeper
An improved testing experience for mdbook
-
quilltex
open-source Rust library designed to convert LaTeX documents into a Delta format that can be used with Quill.js and vice versa
-
butterkups-minigrep
Mini grep utility; very weak application, use grep instead
-
caser
Change text between PascalCase, camelCase, and snake_case
-
wcounter
Give the word and count the appearance
-
jp-location-relation
隣接する市区町村の一覧を取得
-
invisible_unicode
finding invisible unicode characters
-
pandoc_ast
deserializes and serializes the markdown ast for writing pandoc filters
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
dr
Command-line data file processing in Rust
-
libphonenumber-sys
rust ffi bindings to libphonenumber
-
pdfcr
render a codebase to a pdf
-
strip_markdown
remove markdown syntax from markdown files
-
cindex
CSV indexing library
-
rjoin
joining CSV data on command line
-
mdbook-rustviz
An mdbook preprocessor that allows users to embed RustViz visualizations into mdbook projects
-
finalfrontier
Train/use word embeddings with subword units
-
ascii_tree
generates ascii trees
-
markdown-composer
composing markdown documents
-
esc
Escape characters in strings
-
msr-core
Industrial Automation Toolbox - Common core components
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
encoding8
various 8-bit encodings
-
eaverdeja-minigrep
minigrep from chapter 12 of the Rust lang book
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
twie
fast and compact prefix tries
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
mepple
English dictionary as a library
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
diagnostic
Pretty diagnostic report
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
jcalendar
Japanese Calendar for Rust
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
koelner-phonetik
koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words
-
AsgoreCore
A small rust library to manipulate arabic text to fit in non-supporting arabic games or programes
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
psfparser
A PSF reader written in Rust
-
str_overlap
Methods for finding the overlap between two string slices
-
mudders
Generating Lexicographically-Evenly-Spaced Strings, or: Mudder.js in Rust
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
polars_arrow_rvsry99dx
Apache Arrow
-
cnpj
Brazilian CNPJ parsing, validating and formatting library
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
vape
full width aesthetics
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
mime_4
Strongly Typed Mimes
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
bge
Rust interface for BGE Small English Embedding Library
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
uwubot
discord bot for uwuifying text
-
trigram
Trigram-based string similarity for fuzzy matching
-
mdbook-footnote
mdbook preprocessor for footnotes
-
steve
Search Technical Evidence Very Easily
-
gnu-echo-rs
A rewrite of the echo GNU core utility in rust
-
testcall
companinon crate to bintest, implements test facilities
-
romulus
a stream editor like sed
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
asciifolding
ascii folding library
-
mdrss
generating RSS feeds from markdown files
-
todo_r
command line utility that keeps track of your todo comments in code
-
auk_markdown
Markdown support for Auk
-
mdbook-hide
A preprocessor for mdbook that adds support for hidden chapters
-
carnation
some string operators
-
zalgo-text
A command line tool for generating zalgo text
-
kvu
The simplest command line tool to manage key-value pair lines
-
neardup
near-duplicate matching
-
besida
Language for defining branching dialogue
-
product-os-content
Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server
-
slidedeck
Create an HTML slide deck from Markdown
-
runanum
Существительные с правильными окончаниями после чисел
-
uwl
A management stream for bytes and characters
-
group-similar
Group similar values based on Jaro-Winkler distance
-
mdbook-twiki
twiki backend for mdbook
-
axum-toml
Axum extractor for TOML
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
timfmt
A small utility for formatting code as Tim likes it
-
string_morph
string case transformations with an emphasis on accuracy and performance. The case conversions are available as functions as well as traits on String types.
-
html_to_markdown
Convert HTML to Markdown
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
beediff
LCS algorithm in various applications
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
unindenter
unindent text
-
readable-readability
Really fast readability
-
uiua-doc-gen
Documentation generator for Uiua libraries
-
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
random-bytes
generate random bytes
-
crop
A pretty fast text rope
-
aki-xtee
copy standard input to each files and standard output
-
address_book
Інструмент командного рядка для парсингу телефонних номерів, ідентифікаторів, дат та неправильних…
-
vc_8bit
This project is a virtual computer that takes a vector of bytes and runs it as instructions. Also included is a complete assembler and compiler.
-
ctrl-z
A composable reader to treat
0x1A
as an end-of-file marker -
mdbook-chapter-zero
A mdBook preprocessor that allows 0th (sub-)chapter
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
august
& program for converting HTML to plain text
-
glob-match
An extremely fast glob matcher
-
thousand_birds_deno
deno executable
-
summary
Extract the sentences which best summarize a document
-
rvim
A text editor in rust
-
memchr
extremely fast (uses SIMD on x86_64, aarch64 and wasm32) routines for 1, 2 or 3 byte search and single substring search
-
rst-traverse
A terminal based file manager
-
ascii-alphabetic-char
Traits for ASCII alphabetic characters
-
cursed_strings
Annoyed that Rust has two string types? Well it doesn't any more
-
line-straddler
Determine how lines through text (underlines, strikethroughs, etc) should be rendered
-
inkline
Display colorized ascii art to the terminal
-
base16384
Encode binary file to printable utf16be, and vice versa
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
asciicast
file format used by Asciinema
-
allwords
Generate all the words over a given alphabet
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
mdbook-multicode
Allows you to give multilanguage code examples, toggled by a spinner
-
html-escaper
HTML escaping wrapper for core::fmt::Formatter
-
codas
Compact and streamable data format that works anywhere--from web apps to robots
-
azusa
String index transformer for Rust utf8 to JavaScript utf16
-
ascii_converter
converting between different ascii representations
-
ykoath-protocol
Implementaion of YKOATH Protocol
-
latex_snippet
Convert even erroneous LaTeX snippets into HTML
-
hex-utilities
working with hexadecimal numbers
-
ohos-ime-sys
Bindings to the
inputmethod
API of OpenHarmony -
capnp_conv
capnp write/read traits to convert from structs to readers/builders
-
case-conv
Faster case conversion crate
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
ccase
Command line interface to convert strings into any case
-
mdbook-snips
Markers for hidden lines in rust blocks within an mdbook
-
freecut
A cut optimizer gui for cutting rectangular pieces from panels
-
repub
convert markdown documents to epub
-
mdx
in Rust
-
lithe
A Slim template engine by using Pest
-
xim-ctext
compound text en/decoder
-
arabic-script
An expressive API for the characters of the Arabic script
-
trie-match
Fast match macro
-
prune
struct
-
stringedits
Edit trait and associated iterators for small edits to strings
-
char_reader
Safely read wild streams as chars or lines
-
common-words-all
Most common words sorted by ngram frequency
-
spongebobizer
Command-line utility that outputs its stdin, converted to 'sPonGeBoB cAsE', and a library to support it
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
codex
Human-friendly notation for Unicode symbols
-
mdbook-fix-cjk-spacing
mdbook preprocess that fixes CJK line breaks
-
text-template
Small template engine for use with plain text (e.g. creating text email), not intended for HTML.
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
rep-cli
Replace text file in bulk
-
rut
A small UTF-8 parsing library for applications that need to parse individual
char
s -
redpatterns
a list of patterns for scanners 📟
-
textgrid
working with PRAAT .TextGrid files with parsing, riting, manipulation, and history tracking modulesfor TextGrid data
-
rblcheck
Checks DNS RBLs
-
gfm-autolinks
Parse GitHub Flavored Markdown autolinks
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
ik-rs
chinese segment, ik-analyzer for rust
-
floem-peniko
Unofficial peniko crate for Floem
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
smoltoken
A fast library for Byte Pair Encoding (BPE) tokenization
-
mini-openai
An OpenAI API client with minimal dependencies
-
thesauromatic
command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.
-
catdream
Sleeping cat dreams your text
-
rustascii
Display Rust in ASCII
-
help_crafter
help message generator without hussle
-
pager
pipe your output through an external pager
-
tre-regex-sys
Rust bindgen bindings to the TRE regex module
-
strip-tags
Strip HTML and PHP tags from strings
-
ascii-to-hex
A small, simple library to converting an ASCII text string into its hexadecimal equivalent
-
uiuifree-normalize
uiuifree text normalize
-
gpl-memo
Gemachain Program Library Memo
-
anthropic-text-editor
A micro-CLI to apply tool calls from Anthropic for their text_editor_20250124 built-in computer use tool
-
wordninja
port of the Word Ninja English word splitting library
-
rustextile
Textile markup language parser for Rust
-
mathematica-notebook-filter
mathematica-notebook-filter
parses Mathematica notebook files and strips them of superfluous information so that they can be committed into version control systems more easily -
afrim-memory
handle of sequential codes easier for an input method
-
indent_tokenizer
Generate tokens based on indentation
-
leetcode
solutions in Rust
-
texoder
A text stream which can encode/decode text in several encoding formats
-
termwrap
Wrap Unicode text with ANSI color codes
-
sarcasm
tExT creation and validation library
-
code-tour
Enhanced example-based learning, i.e. awesome examples user experience
-
h4x_re
Hacky Regex's
-
munemo-rs
Turn an integer into a more rememberable word, or vice-versa
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
simple_bencode
bencode encoder and decoder, that uses neither rustc-serialize or Serde. Instead, it serializes from / deserializes to a tree using a 4-branch enum.
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
mdbook-to-example
Turns an mdbook book into a Rust example
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
md2gemtext
for converting Markdown into gemtext
-
minigrep_19283712349058
minigrep from The Rust Programming Language book
-
unicode_escape
decoding escape sequences in strings
-
markdown-linkify
Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations
-
freetypegl
Rust build helpers and bindings for freetype-gl
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
vec-string-to-static-str
providing utilities for converting vectors of
String
s into vectors of&'static str
-
hearthstone
simulator written in Rust
-
pig_latin
applying Pig Latin to text
-
chromalog
A customizable logger with dynamic color coding and file logging
-
genpdf
User-friendly PDF generator written in pure Rust
-
unidecode
pure ASCII transliterations of Unicode strings
-
remake
writing maintainable regex and managing symbol soup
-
igpay-atinlay
Translate text to Pig Latin
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
kolorz
A silly little library for printing kolored text to the terminal
-
html-to-pulldown-cmark-events
Parse HTML to pulldown-cmark's events
-
fwuffgrep
Basic implementation of a grep command written in rust
-
bsky-sdk
ATrium-based SDK for Bluesky
-
dvd-term
A bouncing ASCII art DVD logo (or custom text) for the terminal
-
rls-vfs
Virtual File System for the RLS
-
mdbook-docslab
mdBook preprocessor for interactive code with docslab
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
pest_ascii_tree
Helper crates converting the parsing result of any pest grammar into an ascii tree
-
ftd-rt
ftd
-
regex-intersect
Find out if two regexes have a non-empty intersection
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
asimov-account-cli
ASIMOV Account Command-Line Interface (CLI)
-
mdbook-bib
mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references
-
readability-rs
Port of arc90's readability project to rust
-
beautify
your terminal
-
ruby-string
A string type that tracks Ruby glosses attached to parts of it
-
less
pager utility for displaying file contents or piped input, with dynamic scrolling and search functionality
-
re_view_text_document
view that shows a single text box
-
chinese2digits
The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.
-
usage-cli
CLI for working with usage-based CLIs
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
rten-text
Text tokenization and other ML pre/post-processing functions
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
faker_rand
Fake data generators for lorem ipsum, names, emails, and more
-
syllable
counter for use with reading level calculations
-
ftrace
trace files and paths
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
allsorts-subset-browser
Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
unicode-vo
Unicode vertical orientation detection
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
split_ext
Extension traits for splitting
-
libgrep-rs
searching through text
-
rex-regextract
extracts key value pairs out of text
-
bigstr
A command-line tool to make string BIG
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
drive-image-searcher
A CLI tool to stream a drive image, and search for one or more byte patterns
-
fuzzy_mime
A Mime-Type parsing library for rust
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
deeprl
DeepL client library with all the things (blocking)
-
sourcepawn_lsp
Language Server implemention for the SourcePawn programming language
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
logseq
Handle Logseq Markdown files in Rust
-
stylish-plain
stylish
helpers for discarding styles -
baselinker
BaseLinker.com API client
-
b64
Base64 encoding/decoding support. Originally from rustc-serialize.
-
shoebill
A Wadler/Leijen style pretty-printer
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
markdown_to_html_parser
parses Markdown syntax into HTML
-
marko
Programmtically format text with Markdown syntax
-
mdtrans
Markdown parser and transformer using
pest.rs
, focused on flexibility to a project’s needs -
rust_baht_text
Convert number to Thai Baht text
-
periodic_table
that provides a list of elements in the periodic table
-
markdown-it-latex
Allows for the insertion of math in Markdown documents using LaTeX
-
mail-internals-ng
[mail-api] _internal_ parts for the mail-api crates
-
gears
core implementation
-
xenon-lexer
The Xenon compiler's lexer
-
ranpha
Generate QR code of your Wi-FI network
-
transition-table
transition table utilities for keyword parser
-
imagecli
A command line image processing tool
-
yamlate
A cross-language interpreter library that dynamically evaluates YAML attributes given variable bindings
-
aoutils
A tiny utilities package to test publishing to crates.io
-
strings
String utilities, including an unbalanced Rope
-
chars_counter
The trait that implements character counting for the &str type
-
l
my personal library
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
trpl
A support crate for The Rust Programming Language book
-
hunspell-sys
Bindings to the hunspell C API
-
gestalt_ratio
Calculate the gestalt pattern matching ratio between two strings
-
markov-text
creating a small markov model for text generation
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
p4d-mdproof
Markdown to PDF converter
-
book_lib
that provides an API for managing PDFs on your mac device in one place
-
triangular-earth-calendar
An alternative timekeeping system cli tool
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
zuk
Yozuk command-line interface
-
engish
A language utility for sampling letters and building words
-
indoc
Indented document literals
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
caseformat
Power flow case data format
-
cfasttext-sys
fastText ffi binding
-
uniwhat
Display the unicode characters text
-
epubparse
Parse epub and convert to text-only Book structure
-
wkhtmlapp
Convert html to pdf or image
-
text-diff
text diffing and assertion library
-
tree-sitter-stack-graphs-java
Stack graphs for the Java programming language
-
byte-num
converting numbers to bytes, and bytes to numbers in base 10!
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
meme_generator_utils
Meme generator utils
-
writedown-html
Writedown HTML backend
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
skribo
low-level text layout
-
icu-data
International Components for Unicode (ICU) data in Rust structures
-
emojicons-2021
Parse :emoji: notation to unicode representation
-
prettythanks
frontend to dtolnay/prettyplease library
-
ucfirst
Uppercase the first letter of a string
-
phonet
A CLI tool and library to validate phonotactic patterns for constructed languages
-
mdbook-nix-eval
mdbook preprocessor for evaluating nix expressions
-
render_as_tree
visualizing tree data structures via text
-
kanjidic_types
A collection of types encompassing the variety of data about kanji available from Kanjidic
-
utils_rust
这是一个用于各种实用功能的 Rust 库
-
hex_d_hex
HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
sttx
belt for transforming speech-to-text data
-
write-html
writing HTML in Rust
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
yagenerator
Application that uses tinytemplate engine to generate text files. If you have a set of structured data, and need to generated a bunch of arbitrary types of files from it, this tool can help you to save some time.
-
phonics
Phonetic spelling algorithms in Rust
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
toktrie
LLM Token Trie library
-
typos-vars
Source Code Spelling Correction
-
character-set
High performance
set.contains(char)
-
badascii-mdbook
Embed badascii diagrams in your mdbook. See badascii.me for the editor.
-
mdbook-force-relative-links
An
mdbook
pre-processor to transform all local links to relative ones -
framework
detector for different frameworks in one projects
-
mdast2minimad
converting markdown AST to minimad texts
-
words-count
Count the words and characters, with or without whitespaces
-
pinot
Fast, high-fidelity OpenType parser
-
flipperzero-sys
Flipper Zero
-
sauron-markdown
parsing markdown into sauron node
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
ngram-search
Ngram-based indexing of strings into a binary file
-
lunir
A universal intermediate representation oriented towards Lua
-
tectonic_xetex_format
Tectonic/XeTeX engine data structures and their expression in TeX "format" files
-
mrn-generator
generating valid MRNs based on ISO 6346
-
irssi-sys
Automatically generated bindings to irssi
-
find_unicode
Find Unicode characters, the easy way!
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
charmap
one-to-(none/one/many) character mapping
-
gen3-charset
Pokemon Generation 3 Character Set Support (GBA)
-
fsrenamer
refactoring invalid file/dir names
-
transcript
A transcriber for European scripts
-
saneput
Sane input reading library
-
kudubot-bindings
Rust Bindings for the kudubot framework
-
edgesearch
Serverless full-text search with Cloudflare Workers, WebAssembly, and Roaring Bitmaps
-
sayit
String replacements using regex
-
code-splitter
Split code into semantic chunks using tree-sitter
-
mykebab
convert snake_case strings to kebab-case
-
lindera-py
Python binding for Lindera
-
futf
Handling fragments of UTF-8
-
ragtime
Easy Retrieval Augmented Generation
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
goodname
assist you with cool naming of your methods and software
-
deck
A command line tool to generate HTML presentations from Markdown documents
-
regex-filtered
Efficiently check an input against a large number of patterns
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
cyrla
two-way conversion between latin and cyrillic script
-
regex-automata
Automata construction and matching using regular expressions
-
ascii-rs
Process image into colored-ascii image
-
good-mitm-rule
Use MITM technology to provide features like rewrite, redirect
-
pdf-min
Very minimal crate for writing PDFs
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
gdnative-doc
Documentation tool for gdnative
-
single_source
Generate code files from snippets in md tutorial files
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
kincaid
A word statistics library in Rust
-
parattice
Recursive paraphrase lattice generator
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
pocky
A framework for building your own static site generator
-
markdown-includes
Include other documents, table of content, or rust-doc in Markdown using a simple template system
-
arbitrator
Format text based on a set of rules and regexes
-
sm-search
way of searching through text - for people who are too lazy to use Regex
-
typeline_ext_csv
csv parsing and serialization for typeline
-
flw
Process text via configurable tasks
-
djot
Djot parser written in pure Rust
-
html-query-extractor
HTML extractor for hq: jq, but for HTML
-
deepphonemizer
G2P model (inference only)
-
iwes
IWE LSP server
-
prettify-markdown
Format Markdown at the speed of Rust
-
crossandra
A straightforward tokenization library for seamless text processing
-
genex
Text-expansion library
-
corollary
Cross-compiles Haskell into Rust
-
lindera-filter
Character and token filters for Lindera
-
uiuifree-text-data
csv and excel convert
-
character_frequency
counting character frequencies in a string concurrently
-
mdbook-unlink
A mdBook backend that validates local links
-
rulet
figlet implementation
-
uniart
A CLI tool to convert images and gifs to terminal characters
-
merge-whitespace-utils
Procedural macros for merging whitespace in const contexts
-
fcnt
cmd-line tool for counting the number of files in given directories
-
libxdiff
Rust bindings for the libxdiff C library
-
mdbook-typstpdf
An mdBook backend that generates PDF output using Typst
-
morse-nostd
A nostd version of the morse crate
-
webreg
A CLI tool for testing regexes against web pages
-
qpprint
console printing/formatting
-
dequote
Remove nested quotes around text
-
table_to_html
interface to convert a
tabled::Table
into a HTML table (<table>
) -
macro_colors
colorful printing macros
-
jput
puts and putc on unicode-width align for Rust
-
markitdown
designed to facilitate the conversion of various document formats into markdown text
-
doccy
brace based markup language
-
ontodev_valve
A lightweight validation engine written in rust
-
grace-cli
CLI tool for processing files and strings
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
ligotab
Format delimited data with lightweight markup
-
termbook-cli
termbook
is a command-line tool to buildmdbook
’s while executingbash
codeblocks and collecting their output to become part of themdbook
-
text_distance
A collection of approximate string matching algorithms
-
aki-mcycle
mark up text with cycling color
-
vidyut-chandas
A Sanskrit metrical classifier
-
txt_otp
A text based one time pad library
-
rustfits
A light-weight FITS file reader in Rust
-
unic-ucd-segment
UNIC — Unicode Character Database — Segmentation Properties
-
trans-case
Transform case
-
forming
lightweight architecture as code language. 架构描述语言
-
tgo
Heterogeneous data type transtion, it's safe, lightweight and fast
-
uecho
The unicode of the echo command
-
yitizi
異體字查詢 Get variant Chinese characters
-
texc-latex
Contains LaTeX templates for TeXCreate
-
equt-md-ext
Extend event iterator
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
mdtranslation
prepare multi-lingual Markdown documents
-
docfmt
A document formatter using Handlebars templates
-
shapdf
Create Shapes into PDF
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
vroom
Vim macros from the shell
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
rust-cheatsheet
a quick cheatsheet for rust
-
timeharsh
implements the timehash algorithm, an algorithm for creating user configurable, variable-precision sliding windows of time. Useful for binning time values in large collections of data.
-
falcom-sjis
Falcom-compatibile Shift JIS implementation
-
validations
arbitrary types
-
subscript-compiler
A modern LaTeX rendition
-
csvsc
Build processing chains for CSV files
-
combos
Print all permutations of a word list
-
spellabet
Convert characters into spelling alphabet code words
-
kth-lines
Command line tool for filtering stdin lines that just work
-
paxcii
Transform images and videos to ascii
-
rustex
auto-generated LaTeX files in Rust
-
intname
Full English name for any integer of any primitive integer type
-
cronus_parser
The DSL parser for cronus API spec
-
mul
Bengali stemmer
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
pdf-create
low-level, strongly-typed PDF creation library
-
readput
Fast and easy stdin input parsing for competitive programming in rust
-
cozo-ce
A general-purpose, transactional, relational database that uses Datalog and focuses on graph data and algorithms
-
synterm
making beautiful REPLs and Shells with fish like as you type syntax highlighting
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
pdfrust
PDF parser
-
encoding
Character encoding support for Rust
-
search-in-terminal
A terminal-based search tool
-
markdown-it-autolink
A markdown-it plugin for parsing GFM autolinks
-
rusk
a Specification Language
-
diffy-fork-filenames
Fork of https://docs.rs/diffy that allows specifiying filenames
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
p101_enc
convert Olivetti P101 program to and from different encodings
-
xee-xpath
XPath 3.1 library API
-
wit-bindgen-gen-markdown
Markdown generator for WIT and the component model, typically used through the
wit-bindgen-cli
crate -
stone-mason
simplify using the Amazon Bedrock Rust SDK aws-sdk-bedrockruntime
-
terraphim-markdown-parser
Terraphim Markdown Parser
-
indentation
Formatter
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
marker
finding issues in CommonMark documents
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
yozuk-core-skillset
Set of default Yozuk skills
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
lindera-dictionary
A morphological analysis library
-
ucd-util
A small utility library for working with the Unicode character database
-
bidi
Unicode Bidirectional Algorithm (UBA)
-
tradukisto
Kinda useful natural language translation library and utility
-
sparklet
small flashcards library
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
write16
A UTF-16 analog of the Write trait
-
itext
Safe rust bindings to the iText 7 PDF generation library written in Java
-
protobuf
Protocol Buffers - Google's data interchange format
-
wn-parser
parser for WordNet database files
-
ftd
ftd: FifthTry Document Format
-
draconis
Small terminal welcome program written in rust
-
markovish
Markov chain implementation for text generation
-
stylish-stringlike
API for string-like objects that have styles applied
-
maud-pulldown-cmark
An adapter between maud and pulldown-cmark
-
webvtt-parser
WebVTT parser for Rust
-
string-cases
String case conversion utilities
-
yuto51942-servant
cli
-
mdbook-morsels
Morsels plugin for Mdbook
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
blitztext
fast keyword extraction and replacement in strings
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
minigrep_iaziz786
grep
-
passgenr
generating cryptographically-secure passwords in Rust
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
text-to-json
Convert text to json in rust
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
md-localizer
Localize markdown with remote links
-
aprilasr-sys
Low-level FFI bindings for the april-asr C api (libaprilasr)
-
mdbook-fishextract
A mdbook preprocessor which handles mermaid graphs, offline, requires
mmdc
-
findtext_doc
Search text in Document
-
ezemoji
Catigoryized Emoji's
-
moscato
Outline scaler for OpenType glyphs
-
tdk_sozluk
TDK Sözlük API verilerini çeken bir Rust kütüphanesi
-
lodestone
A website wrapper for FFXIV's lodestone
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
aaa
CLI tool for work with 3a files
-
asimov-repository-cli
ASIMOV Repository Command-Line Interface (CLI)
-
pcre2
High level wrapper library for PCRE2
-
gregex-logic
Logic for the gregex crate
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
tiny_pretty
Tiny implementation of Wadler-style pretty printer
-
dekor
styling and character repository in Rust
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
unic-ucd-normal
UNIC — Unicode Character Database — Normalization Properties
-
swc_plugin_import
babel-plugin-import rewritten in Rust
-
deface
Lightweight markup to HTML converter
-
parser-web
Web API for extracting text from various file formats
-
wxf-converter
Transform yaml, json, pkl files to wolfram
-
jp_utils
Utils for working with Japanese text
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
ewin-com
editor for Window(GUI) users.No need to remember commands
-
ascii-read
BufRead
-like methods for reading into anAsciiString
-
beemovie-cli
Bee Movie CLI Application
-
arg_input
ARGF-style input handling for Rust
-
bionic-ebooks
Takes an EPUB file and generate a copy with bionic like font applied
-
squ
command-line utility for converting quotation marks in plaintext files to "smart quotes"
-
count-md
configurable command-line tool and Rust library for Unicode-aware, Markdown-aware, HTML-aware word counting in Markdown documents
-
md-designer
A CLI tool for creating design docs in Markdown
-
mistletoe
Polyglot Kubernetes Package Manager
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
askama-filters
Extra template filters for Askama
-
mdbook-open-gh-issue
mdbook preprocessor to add a open-on-github link on every page
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
text-to-png
way to render text to a png image with basic options
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
text-tables
A terminal/text table prettifier with no dependencies
-
chardet
rust version of chardet
-
umlauts
text transformation of german umlauts
-
retest
Command-line regular expression tester
-
xlsxwriter
Write xlsx file with number, formula, string, formatting, autofilter, merged cells, data validation and more
-
twitch2csv
stream the chats of Twitch channels as a CSV
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
beemovie
Bee Movie crate
-
mdbook-asciidoc
mdBook backend for AsciiDoc generation
-
ryaspeller
lib for searching typos in text, files and websites
-
scripter
A screenplay compiler
-
pdf_composer_definitions
PDF Composer definitions crate
-
moenarchbook
Creates a book from markdown files
-
zbuf
“Zero-copy” string and bytes buffers
-
linetime
command line utility to add timestamps at the start of lines. The tool can either process lines from stdin or execute a command and process lines from the command's stdout and stderr.
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
mdbook-all-the-markdowns
Render all markdown files in a given folder structure
-
dprint-plugin-sql
SQL formatter for dprint via sqlformat-rs
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
aki-stats
output the statistics of text, like a wc of linux command
-
findtext_textfile
Search text in text file
-
wcount
CLI word counting tool
-
struckdown
A structured markdown / commonmark library for Rust
-
soundchange
implementing sound change algorithms in Rust
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
gret
command line tool to search for patterns and show matches in a tree structure
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
text-parsing
Hierarchical text processing preserving char position info
-
unicode-box-drawing
Unicode box-drawing characters
-
difference
text diffing and assertion library
-
asimov-patterns
ASIMOV Software Development Kit (SDK) for Rust
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
mmseg
Chinese word segmenation algorithm MMSEG in Rust
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
linkcheck
extracting and validating links
-
presciidoc
Preprocessing AsciiDoc for other tools
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
static_table
creates pretty tables at compiler time
-
ragzilla
providing tools for RAG (Retrieval-Augmented Generation) pipelines
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
adobe-cmap-parser
parse Adobe CMap files
-
latin1str
Windows-1252 string types
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
utf8-command
UTF-8 encoded
std::process::Command
output -
psa
PSA(Project structure analysis) is a analyzer for analysis project struct
-
rusttyper
Basic text layout, using rusttype
-
ogrep
searching in indentation-structured texts
-
wz
Count words, fast
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
kanpyo
Japanese Morphological Analyzer
-
nlprule-build
Build tools for a fast, low-resource Natural Language Processing and Error Correction library
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
ron_to_table
pretty print RON as a table
-
rdg
Random data generator for the command line
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
carlotk
The main library for Carlo, a simple interpreted programming language
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
hvm-core
massively parallel Interaction Combinator evaluator
-
pact_matching
Pact-Rust support library that implements request and response matching logic
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
uniaxe
replace Unicode letters with Ascii equivalents
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
code-to-pdf
Generates a syntax-highlighted PDF of your source code
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
cw
Count Words, a fast wc clone
-
korrektor
work with Uzbek language text processing
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
rsrpp
project for research paper pdf
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
encoding_c
C API for encoding_rs
-
stringsort
Pathological sorting of string characters
-
dvi2html
converter
-
latex-to-html
Latex to html converter
-
unicount-lib
Alphabetic counter supporting unicode
-
user_doc-tests
Tests for user_doc
-
grammateus
facilitate working with Ancient Greek words
-
backslash
parsing escape characters
-
assert-text
the testing macro tools
-
strip-ansi-escapes
Strip ANSI escape sequences from byte streams
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
genere
randomization of text respecting grammatical gender of sentences
-
syllarust
quickly counting syllables
-
eudex
A blazingly fast phonetic reduction/hashing algorithm
-
simple-xml-builder
XML builder/writer
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
utf16-ext
Extensions for reading and writing utf-16
-
typos-dict
Source Code Spelling Correction
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
demoji
Remove all emojis from a string
-
unicode-canvas
creating text base drawing
-
strizer
minimal and fast library for text tokenization
-
e_book_sync_library
Synchonize e-book with your local e-library
-
shallow
long text
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
spellcheck_toy
a basic spellchecking library based on edit distance
-
vtext
NLP with Rust
-
const_format_proc_macros
detail of the
const_format
crate -
text-sanitizer
convert text to plain ASCII text
-
mdbook-translation
prepare multi-lingual mdBook books
-
gesha-core
Core functionality for Gesha project
-
anagrambot
find anagrams of words
-
typeline
Efficient, Type-Safe Pipeline Processor
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
shelldon
your new Rust-powered buddy with GPT features!
-
mdbook-webinclude
Preprocessor for mdBook that includes content from URLs
-
stylish-html
stylish
helpers for writing styles as HTML elements -
text_manipulation_rs
generating random placeholder text in different languages
-
bpmf_py
A Bopomofo and Pinyin library
-
endf_parser
parsing ENDF-6 format nuclear data
-
github-slugger
A slugger for GitHub headings
-
minigreper
Small grep style cli from the book
-
pikchr-cli
PIC-like diagramming language to SVG converter
-
ced
Dead easy csv editor
-
chisel-lexers
Chisel backend lexers/scanners
-
mdtable-cli
that makes creating tables in markdown much easier!
-
wfst4str
Python library based on rustfst for manipulatig strings with wFSTs
-
base100
Encode your data into emoji
-
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
word_filter
A Word Filter for filtering text
-
zw
encoding and decoding text using zero-width characters
-
mdbook-playscript
Preprocessor for mdBook, which styles stage play scripts
-
ngrams
Generate n-grams from sequences
-
csvre
replacing data in CSV columns with regular expressions
-
dhoni
converting Bengali text into their phonetic counterpart
-
dd
a clone of the unix coreutil dd
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
once-cell-regex
just gives you the
regex
macro from theonce_cell
docs! -
rustinsight
The launcher app for the interacive book
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
txt_to_md
Command converting from a txt file to a markdown file
-
lines_lossy
extension to
BufRead
with a functionlines_lossy
that works likeBufRead::lines
but with lossy UTF-8 decoding -
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
encoji
Emoji based encoding and decoding. 🔥🔥🔥🚀
-
tantivy-object-store
A tantivy Directory implementation against object stores (S3, GCS, etc.)
-
oxcomm
using Google Translate on the fly
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
cmark2tex
A small utility to convert markdown files to pdf exploiting tectonic
-
charjpoet
Charj Poet is a API for write to
.cj
language -
tiny-gradient
Make your string colored in gradient
-
seven_seg
Seven-segment digital display for terminal
-
scrambler
command line tool to scramble letters
-
markdown-heading-id
Filter for
pulldown-cmark
which converts headings with custom ID -
corpus-preproc
A preprocessor for text and HTML corpora
-
tb_normalization
normalization utf8 string, loc dau vietnamese and some language
-
bocu1
BOCU-1 compressed unicode encoding
-
pdfutil
PDF document manipulation
-
braille_pics
producing text-art pictures using Braille characters
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
br-pdf
PDF Invoice Processing
-
clparse
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format
-
terminal_cli
A standalone library with no-std support for command line terminal interfaces. With autocomplete support, helpers for commands and properties and a prompt implementation.
-
yozuk-helper-english
English NLP utilities for Yozuk
-
cute_strings
colorize strings in the terminal
-
bookrafter
This repository contains code related to bookrafter rendering
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
wordnet
Read a wordnet dictionary in Rust
-
fastn-builtins
fastn: Full-stack Web Development Made Easy
-
rpdf
PDF command-line utils written in Rust
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
czv
performing CSV-related operations for data engineering and analysis
-
num2en
For converting integer and decimal numbers into English cardinal or ordinal number words
-
git-busy
A wrapper around "git commit" that generates the commit messages for you
-
borderrs
Add stylish borders around your text and datastructures
-
naromat
Convert text to narou novel format
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
bos_books_codes
that handles 3-character Bible Books Codes
-
untex
Understand and manipulate TeX files with ease
-
lindera-compress
A morphological analysis library
-
md-include
include any file in markdown files
-
whitespace
Encode arbitrary data whitespaces and vice versa
-
latexify
Shared definition for turn a rust object into latex code
-
encoding-next-types
Traits and types for the encoding package
-
yhy-email-encoding
Low level email encoding RFCs implementations
-
convert_encoding
Convert encoding of text files in batch
-
mdbook-extended-markdown-table
Preprocessor for mdBook that generates tables with merged cells from ASCII text
-
codegenrs
Moving code-gen our of build.rs
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
top-english-words
retrieve top words from the English language
-
crypto-invert
Unicode Upside-Down Mapping
-
test-catalog
Collect and export test cases as a catalog
-
mdbook-reference-table
mdBook preprocessor to create reference tables
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
ankiding
Creating Anki-Flashcards within Markdown!
-
gqlog
👾 filter your json logs with graphql 👾
-
basic_lexer
Basic lexical analyzer for parsing and compiling
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
skyspell_core
skyspell core library
-
admerge
Merge multiply sources into one, with advanced options
-
bytescolor
A versatile Rust library for colorizing strings and byte data in terminal applications using ANSI escape codes
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactor
sub-command. Compile BibTeX files to legacythebibliography
TeX code using thecompile
sub-command… -
toml_to_table
pretty print TOML as a table
-
hina
:]
-
df_cp437
Decoder for CP437 to UTF-8
-
aklat
create books from markdown files (like Gitbook)
-
ascii_code_finder
find ascii code of a character or get a character by its ascii code
-
veryfi
Module for communicating with the Veryfi OCR API
-
lines
Utililities for iterating readers efficiently line-by-line
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
lindera-sqlite
Lindera tokenizer for SQLite FTS5 extention
-
spongedown
Converts markdown to html with svgbob support
-
asimov-dataset-cli
ASIMOV Dataset Command-Line Interface (CLI)
-
textocx
Tex code to Office MathML
-
blockcounter
Counts the blocks in a stream
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
mime-rs
A text processing framework, inspired by Emacs lisp and keyboard macros
-
wz-utf16
UTF-16 counters for wz
-
iasthk
Harvard-Kyoto to IAST conversion
-
txttyp
Formatted string typewriter
-
fast_aug
Fast data augmentation for text
-
morc
Dead simple, minimal markdown generator library written in Rust
-
markdown2unicode
Converter from markdown notation to unicode characters
-
xsv
A high performance CSV command line toolkit
-
alpino-tokenize
Wrapper around the Alpino tokenizer for Dutch
-
emojicons
Parse :emoji: notation to unicode representation
-
szovegertesimutato-score
Calculate szovegertesimutato score for a given text and language
-
pta-generator
Test data generator for PTA applications
-
jellybean-pack-0
Sweet syntax highlighting with tree-sitter
-
stamd
Webservice for working with stand-off annotations on text (STAM)
-
opencc
binding for Rust
-
buf-trait
abstract over [u8], str, and friends
-
cattocol
Combine two text into one text as columns
-
hline
a grep-like tool that highlights lines in files
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
vaporetto_rules
Rule-base filters for Vaporetto
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
lithe-cli
A cli of lithe
-
bytepiece_rs
The Bytepiece Tokenizer Implemented in Rust
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
sprinkles
Randomly colors input text and outputs it to the terminal
-
maybe_utf8
Byte container optionally encoded as UTF-8
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
quick_io
facilitate input and output within programs, with a set of macros
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
tiniestsegmenter
Compact Japanese segmenter
-
llmvm-core-lib
llmvm core application
-
rustrawi
Rust port of the original PHP Sastrawi
-
lindera-cc-cedict
A Japanese morphological dictionary for CC-CEDICT
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
fnew
A Unicode-aware line-oriented drop-in replacement for coreutils' fold
-
contractions
expand contractions in English
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
asimov-cli
ASIMOV Command-Line Interface (CLI)
-
ascii_utils
handle ASCII characters
-
glyphana
Quickly find, inspect & collect unicode glyps
-
norm-email
strip email provider defined behaviour from email addresses
-
aki-txpr-macro
the more easy to use libaki-*
-
tfidf-summarizer
Basic tf-idf compute for documents
-
parser-cli
Command-line interface for extracting text from various file formats
-
fbihtax
CLI tool to help manage tax payments in FBiH (Bosnia and Herzegovina Federation)
-
worcher
full-text search for static websites
-
pulldown-cmark-fork
A pull parser for CommonMark
-
cjieba-sys
unsafe ffi to cppjieba
-
chisel-parsers
Chisel parser front ends
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
varcon-core
Varcon-relevant data structures
-
fmtm_ytmimi_markdown_fmt
Fork of @ytmimi's Markdown formatter; powers FMTM
-
mdbook-chapter-number
A mdBook preprocessor that adds chapter numbers to the each page header
-
xsystem
Conversion between the Esperanto x-system and Unicode circumflexes
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
yozuk-model
NLP model generator for Yozuk
-
hsk
Return HSK Level for Simplified Chinese Characters
-
quartz_commands
Generates a parser at compile-time for handling commands similar in structure to those of Minecraft
-
anagram
A collection of anagram utility functions
-
modit
Modal editor parser
-
suffix
arrays
-
typeline_ext_sqlite
sqlite integration for typeline
-
quill_delta_pdf
Convert Quill Delta to PDF
-
lindera-ipadic-neologd
A Japanese morphological dictionary for IPADIC NEologd
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate
-
rustyword
An anagram finder
-
mupdf-sys
Rust FFI binding to MuPDF
-
cmdcjones_minigrep
A minimal grep clone from the Rust Book
-
kilo
small, fast utility crate/library for manipulating strings and generating sourcemaps with all in Magic 🪄
-
case_convert
Converts the first letter of a Rust String to uppercase
-
asciify
converting images to a readable format on the command line
-
latex
An ergonomic library for programatically generating LaTeX documents and reports
-
docstring
manipulating and parsing documentation strings
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
textos
Texts, strings, formatting, unicode…
-
mathml-latex
Convert between MathML and LaTeX
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
word_iter
Iterator over all words in a string
-
aki-json-pick
The json pick out command
-
re2
Wrapper for the re2 C++ regex library
-
fst-subseq-ascii-caseless
An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst
-
pdf_encoding
Font related encodings
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
file-search
File indexing and search
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
with-str-bytes
Safely manipulate the bytes of a UTF-8 string
-
perlin
A lazy, zero-allocation and data-agnostic Information Retrieval library
-
ttf_word_wrap
Wraps text based on character width
-
unicode_skeleton
detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
findtext_sheet
Search text in SpreadSheet
-
uwu_cli
uwuifying the terminal
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
rtlicious
A nom-based parser for Yosys RTLIL files
-
asciir
Print ASCII table/values
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
tadm
A collection of algorithms and data structures wrote out while reading The Algorithm Design Manual book
-
rough
A very simple and opinionated static site generator
-
text_to_emoji
Convert text to emoji
-
rusty_code_code_for_book
my book_rusty code
-
catmark
Console printer for CommonMark
-
minigrep_flict
Simplest text-in-file search engine from rust book
-
lorgn_lang
a general purpose scripting language optimized for graphical programming
-
textr
TeX-inspired plug-n-play interface for converting JSON documents into PDFs
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
mdbook-files
Preprocessor for mdbook which renders files from a directory as an interactive widget
-
spyglass
Search engine for documents, inspired by bioinformatics
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
rigrep
grep from Rust Book
-
masker
Mask patterns in data
-
mdbook-trace
A traceable document preprocessor for mdbook
-
md-dir-builder
Webserver for serving all markdown files in a directory
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
twemoji-rs
A word-cloud image generation crate
-
token-counter
wc
for tokens: count tokens in files with HF Tokenizers -
txtframe
Creates a frame for text
-
perspicuity_formula
Calculate Flesh Reading Ease for a given text and language
-
overlap
shows overlap text in files
-
mdbook-mathpunc
An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex
-
poetry-book
Create a poetry book in latex, starting from plain text
-
merge_pdf
Merge PDF files in a directory
-
japanese-ruby-filter
Japanese ruby notation parser
-
utf8reader
wrapper around Reader that returns a stream of UTF-8 characters
-
jellybean-pack-2
Sweet syntax highlighting with tree-sitter
-
encoding_c_mem
C API for encoding_rs::mem
-
trexter
Text progression tracking library
-
decline-word
Choose word form based on given number
-
llmvm-outsource-lib
outsource backend for llmvm
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
transliterate1234
UTF-8 to ASCII transliteration
-
html_to_pdf_lib
converting HTML to PDF
-
bgrep
grep tailored to handle binary patterns and files
-
compiler-tools
A proc-macro for deriving powerful and fast tokenizers with compile-time regex
-
termbook
behind the
termbook-cli
-
unidok
A powerful, readable, easy-to-learn markup language
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
books_description_parser
A Rust-based parser to extract book details from structured markdown-like text and output them in formats like JSON or Rust structs for further processing
-
chapter-8-exercises
Exercises from the 8th chapter of the book
-
static_format
Format strings with no runtime overhead
-
ascii_set
Fast membership of ASCII character classes
-
ewts-c
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (c lib)
-
pdf_form
programatically filling out pdf forms
-
pdf_composer_base
PDF Composer base functionality crate
-
minigrep_desonglll
grep implementation from The Rust Programing Book
-
tashkil
A lightweight library for removing Arabic diacritics
-
h_hangul
Korean Characters
-
json-peek
Amature JSON parser library designed for my specific need
-
swappy
An anagram generator
-
pix-brcode
A ready to use compliant PIX specification, featuring fast de/serialization
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
ddvm
Document to Document Virtual Machine
-
slicer
that slices string slices into smaller string slices
-
openlibrary-rs
A wrapper around openlibrary's Web API
-
lindera-wasm
A morphological analysis library for WebAssembly
-
find-simdoc
Time- and memory-efficient all pairs similarity searches in documents
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
lingua-dutch-language-model
The Dutch language model for Lingua, an accurate natural language detection library
-
bqrs
apply boolean query to text
-
indexrs
inefficient multi-language search index
-
tpng
A small tool that prints truecolor png renderings to the terminal using unicode block characters
-
llmvm-chat
An llmvm frontend that acts as a CLI chat interface
-
grep-table-converter
A cli utility to convert grep result to table (csv, markdown, textile)
-
saku
efficient rule-based Japanese Sentence Tokenizer
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
simplearrayhash
-
tex
The νTeX typesetting engine
-
mnumonic
A tiny library to convert opaque binary data to and from a human-memorable phrase
-
jieba-macros
jieba-rs proc-macro
-
code-span
Add additional infomation to code character
-
lindera-assets
A helper crate to fetch assets and build dictionary for lindera
-
pattern-3
Needle API (née Pattern API 3.0), generalization of
std::str::pattern
-
align_text
Aligns lines in a block of text within a number of columns
-
nipah_tokenizer
A powerful yet simple text tokenizer for your everyday needs!
-
emoji_converter
Converts text to emojis
-
alphabet-encoder
A quick and dirty way to deal with escape characters
-
escaped-delimiter
Iterator of delimited slices with escape characters
-
allsorts_no_std
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
folia
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
mdbook-post
A CLI for add post to mdbook
-
codebook
A code-aware spell checker library (dependency for codebook-lsp)
-
minigrep_lswarss
A very small part of Unix/Linux tool
grep
made with Rust for learning purpose while reading and studying the Rust Book -
unic-ucd-common
UNIC — Unicode Character Database — Common Properties
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
minigrep_bakedspacetime
Minimal Rust implementation of grep based on The Book
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
font-map-core
Core font-parsing capabilities for font-map
-
jellybean-pack-1
Sweet syntax highlighting with tree-sitter
-
unic-ucd-name
UNIC — Unicode Character Database — Name
-
conveyance
A stop-gap CLI for conveyancing
-
boxy
Declarative builder for Unicode box-drawing characters
-
typeline_ext_http
http(s) tooling for typeline
-
panduck-latex
Use panduck to generate XeLaTeX
-
flesh-reading-ease
Calculate Flesh Reading Ease for a given text and language
-
raekna-parser
code needed to parse string slices into Expressions that can later be evaluated
-
pdf_forms
programatically filling out pdf forms
-
lingua-italian-language-model
The Italian language model for Lingua, an accurate natural language detection library
-
lindera-dictionary-builder
Shared code for building Lindera dictionary files
-
txt_processor
A little library for text processing
-
tectonic_pdf_io
Xdvipdfmx’s PDF, XDV, and image I/O APIs in C, as a Rust crate
-
unic-common
UNIC — Common Utilities
-
fum
fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules
-
unicode_clusters
variable width unicode characters as single items, allowing for array like indexing etc
-
genpdfi
User-friendly PDF generator written in pure Rust
-
infisearch_lang_ascii
Basic ascii tokenizer for InfiSearch
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
writedown
format parser
-
bookbinder
Produce books in various formats from markdown, with some understanding of structural semantics and rendering options
-
lipsum-cn
Pseudo-Chinese lorem ipsum generator
-
rust-jieba
Rust binding to cppjieba
-
guarding_parser
Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
indentation_flattener
From indented input, generate plain output with indentation PUSH and POP codes
-
wkhtmltox-sys
FFI bindings to wkhtmltox
-
shift_or_euc
Detects among the Japanese legacy encodings
-
bisect
search stdin based on a bitstring pattern
-
mini-grep
A test crate with mini grep as in The Book
-
pdf-annotations-converter
Converts annotations found in PDF files to different formats
-
lix-score
Calculate LIX score for a given text and language
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
lindera-analyzer
A morphological analysis library
-
textract
extract text from various types of files
-
xgrepx
xgrep is a rust implementation of grep. This is a follow up from the rust book
-
lingua-arabic-language-model
The Arabic language model for Lingua, an accurate natural language detection library
-
textframe
query plain text documents by unicode offset without loading them all into memory
-
string-sections
Build tool for Leptos
-
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
typeline_ext_python
python integration for typeline
-
minigrep_crate
grep like console application
-
grep-searcher
Fast line oriented regex searching as a library
-
unic-ucd-case
UNIC — Unicode Character Database — Case Properties
-
ssml-parser
parsing speech sythnesis markup language
-
md2pdf
A small utility to convert markdown files to pdf exploiting tectonic
-
lindera-tokenizer
A morphological analysis library
-
lingua-slovak-language-model
The Slovak language model for Lingua, an accurate natural language detection library
-
deepfrog
A deep learning NLP suite (PoS,lemmatiser,NER) with FoLiA XML support
-
grep-pcre2
Use PCRE2 with the 'grep' crate
-
hello_rust_lang_book_chpater_20
rust lang book chapter 20
-
lindera-core
A morphological analysis library
-
gulpeaseindex
Calculate Gulpease index for a given text and language
-
ruby-parser
A parser for the Ruby language
-
findtext_pdf
Search text in PDF
-
unic-ucd-block
UNIC — Unicode Character Database — Unicode Blocks
-
rusty_word_builder
Syllable and Word generation library written fully in Rust
-
scenic
Presentation tool that utilizes LaTex Beamer package
-
grep-regex
Use Rust's regex library with the 'grep' crate
-
json_to_table
pretty print JSON as a table
-
lingua-japanese-language-model
The Japanese language model for Lingua, an accurate natural language detection library
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
chardetng_c
C bindings for chardetng
-
grep-matcher
A trait for regular expressions, with a focus on line oriented search
-
jpreprocess-core
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
ssam
short for split sampler, splits one or more text-based input files into multiple sets using random sampling. This is useful for splitting data into a training, test and development sets, or whatever sets you desire.
-
fountain
Parser and formatter for the Fountain screenplay markup language
-
lingua-czech-language-model
The Czech language model for Lingua, an accurate natural language detection library
-
unic-ucd-name_aliases
UNIC — Unicode Character Database — Name Aliases
-
grep
Fast line oriented regex searching as a library
-
ucd-trie
A trie for storing Unicode codepoint sets and maps
-
lingua-finnish-language-model
The Finnish language model for Lingua, an accurate natural language detection library
-
lingua-polish-language-model
The Polish language model for Lingua, an accurate natural language detection library
-
unic-ucd-hangul
UNIC — Unicode Character Database — Hangul Syllable Composition & Decomposition
-
lingua-swedish-language-model
The Swedish language model for Lingua, an accurate natural language detection library
-
lingua-danish-language-model
The Danish language model for Lingua, an accurate natural language detection library
-
lingua-hebrew-language-model
The Hebrew language model for Lingua, an accurate natural language detection library
-
lingua-russian-language-model
The Russian language model for Lingua, an accurate natural language detection library
-
lingua-persian-language-model
The Persian language model for Lingua, an accurate natural language detection library
-
unic-char
UNIC — Unicode Character Tools