LexPrep – Open-source toolkit for linguistic stimulus preparation
Psycholinguistics-focused: syllables, G2P, orthographic neighborhoods—spaCy is overkill.
Open research infrastructure for reproducible data preparation
Reproducible wordlist preprocessing with automatic manifest output, not text analysis.
Linguistic researchers, psycholinguists, cognitive scientists running controlled experiments with word stimuli.
spaCy · Stanza · g2p-en
If you find it interesting, a GitHub star would help visibility
Psycholinguistics-focused: syllables, G2P, orthographic neighborhoods—spaCy is overkill.
Funding marketplace meets reproducible ML execution with dry-run validation before GPU budget burns.
Quirky global word elimination game, but the 403 error blocks access entirely.
Git diff tracking without commit clutter solves the real experiment iteration pain.
AI agent actually fixes bugs in real VMs, not just prompting. Firecracker isolation + verified PRs.
Fixes WER scores by normalizing '$50' and 'fifty dollars' as equivalent.