| 1. | | Show HN: Skeletoken, a Python package for editing model tokenizers (github.com/stephantul) |
| 1 point by stephantul 20 days ago | past |
|
| 2. | | Show HN: PyNIFE. 400-900× speedup for embedding-based retrieval pipelines (github.com/stephantul) |
| 2 points by stephantul 3 months ago | past |
|
| 3. | | Show HN: Skeletoken, a Package for Editing Tokenizers (github.com/stephantul) |
| 1 point by stephantul 5 months ago | past |
|
| 4. | | Turning any tokenizer into a greedy one (stephantul.github.io) |
| 2 points by stephantul 6 months ago | past | 1 comment |
|
| 5. | | Decasing Transformers for Fun (stephantul.github.io) |
| 3 points by stephantul 6 months ago | past | 1 comment |
|
| 6. | | Model2Vec as a Fasttext Alternative (minish.ai) |
| 5 points by stephantul 7 months ago | past | 1 comment |
|
| 7. | | Using overloads to handle union return types in Python (stephantul.github.io) |
| 1 point by stephantul 11 months ago | past | 1 comment |
|
| 8. | | Ask HN: Favourite resources for learning programming type theory? |
| 6 points by stephantul 11 months ago | past | 8 comments |
|
| 9. | | Evaluating ML classifiers using relative error instead of absolute accuracy (stephantul.github.io) |
| 1 point by stephantul 11 months ago | past |
|
| 10. | | Defeat stringly typing without making your users unhappy (stephantul.github.io) |
| 2 points by stephantul 11 months ago | past |
|
| 11. | | Distilling ModernBERT into a static model doesn't work (minishlab.github.io) |
| 5 points by stephantul on Jan 29, 2025 | past | 3 comments |
|
| 12. | | Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets (github.com/minishlab) |
| 6 points by stephantul on Jan 19, 2025 | past |
|
| 13. | | Train faster static embedding models with sentence transformers (huggingface.co) |
| 52 points by stephantul on Jan 15, 2025 | past | 1 comment |
|
| 14. | | Semhash: Fast deduplication and dataset multitool in Python (minishlab.github.io) |
| 3 points by stephantul on Jan 13, 2025 | past | 1 comment |
|
| 15. | | Model2Vec: Make sentence transformers 500x faster on CPU, 15x smaller (huggingface.co) |
| 5 points by stephantul on Oct 16, 2024 | past |
|
| 16. | | Show HN: Model2Vec: make sentence transformers 500x faster on CPU, 15x smaller (github.com/minishlab) |
| 9 points by stephantul on Sept 29, 2024 | past | 2 comments |
|
| 17. | | Show HN: Model2Vec: make sentence transformers 500x faster on CPU, 15x smaller (github.com/minishlab) |
| 6 points by stephantul on Sept 22, 2024 | past | 2 comments |
|