Rust based Viewer: Any document, any platform, in milliseconds
3 months ago
- #rust
- #document-processing
- #sdk
- Prism is a next-generation document processing SDK built in Rust, supporting 600+ file formats.
- Key features include comprehensive format support, modern architecture, cloud-native design, security, developer-friendly APIs, and high performance.
- Supported formats include Office documents, PDFs, emails, images, archives, and CAD files.
- Built with Rust for memory safety, performance, and reliability.
- Designed for containerization, horizontal scaling, and serverless deployment.
- Includes WebAssembly sandboxing for parser isolation and security.
- Offers SDKs for 10+ programming languages.
- Components include prism-core, prism-parsers, prism-render, prism-sandbox, prism-server, and prism-cli.
- Provides CLI tools for document detection, conversion, text extraction, and metadata extraction.
- REST API endpoints include health checks, version info, format detection, conversion, text extraction, metadata extraction, and rendering.
- Can be integrated into .NET applications via prism-bindings.
- All document formats are parsed into a common intermediate representation (Document model).
- Includes a Docker setup for easy deployment.
- Performance targets include fast format detection, conversion, text extraction, and thumbnail generation.
- Security features include parser sandboxing, no code execution, memory limits, timeout protection, and no I/O access.
- Roadmap includes expanding format support, AI-powered features, enterprise features, and certifications.
- Dual-licensed under AGPL-3.0 for open source and commercial licenses for proprietary applications.
- Inspired by Oracle Outside In, Apache POI, LibreOffice, and other document processing libraries.