Batch Embedding Generation
by AaaS · open-source · Last verified 2026-03-01
Generates embeddings at scale for large document collections with batching, rate limiting, checkpointing, and error recovery. Supports multiple embedding providers (OpenAI, Cohere, local models) with automatic dimension detection and output format selection.
https://aaas.blog/script/embedding-generation-batch ↗C
C—Below Average
Adoption: BQuality: AFreshness: B+Citations: FEngagement: F
Specifications
- License
- MIT
- Pricing
- open-source
- Capabilities
- batch-embedding, rate-limiting, checkpointing, error-recovery, multi-provider-support
- Integrations
- openai, sentence-transformers, numpy, pandas
- Use Cases
- large-scale-indexing, search-index-building, migration, batch-processing
- API Available
- No
- Language
- python
- Dependencies
- openai, sentence-transformers, numpy, pandas, tqdm
- Environment
- Python 3.11+
- Est. Runtime
- 10-120 minutes depending on corpus size
- Tags
- script, automation, embeddings, batch, processing
- Added
- 2026-03-17
- Completeness
- 80%
Index Score
42Adoption
66
Quality
80
Freshness
78
Citations
0
Engagement
0