Jazyl Platform
Archive Platform · Award-Winning Arabic AI

Turn Any Arabic Collection Into Living, Searchable Knowledge.

Jazyl Archive is an AI-native digital repository for Arabic heritage and records. It reads manuscripts, newspapers, official documents, and media — then makes every page instantly searchable, answerable, and citable. Built ground-up for Arabic script.

Arabic OCR · HTR · ICR Semantic + RAG search Sovereign hosting
app.jazyl.qa / archive
Ask the archive
Every answer cited
Best AI Solution 2024Qatar Digital Business Award
Top 3 — R&D ExcellenceProprietary Arabic AI · US patent filed
Library-grade standardsOAI-PMH · Dublin Core · METS/MODS
ANY SOURCE · ANY SCRIPT

From Fragile Originals to Structured Intelligence.

Manuscripts, historical newspapers, official gazettes, handwritten registers, printed books, photographs, audio and video — Jazyl ingests it all, in Arabic and beyond.

IN PRODUCTION

See It Work on Real Collections.

Not mock-ups — these are live screens from deployments running on manuscripts, media, and official records.

Digitized Arabic manuscript with a phrase search returning page-cited matches
MANUSCRIPTS Search inside manuscripts Find any phrase across digitized manuscript pages — every match highlighted and cited to its page.
Documentary video player with a transcript search jumping to timestamped moments
MEDIA Searchable video & audio Speech is transcribed and indexed — jump straight to the moment a phrase was spoken.
Conversational AI answering an Arabic question with a cited source from the collection
ASK AI Ask the archive Ask in Arabic; the answer comes from inside your own collection, with the source cited.
THE AI ENGINE

It Reads What Other AI Can't.

Standard OCR breaks on Arabic script; generic models can't read centuries-old handwriting. Jazyl's Arabic-first engine was trained for exactly this — segmenting, recognizing, and structuring text other systems leave invisible.

  • OCR · HTR · ICR
    Printed, handwritten, and intelligent character recognition for Arabic.
  • Auto-diacritization (تشكيل)
    Restores Arabic diacritics to sharpen meaning and search.
  • Media → Text (STT)
    Transcribe Arabic audio and video into searchable, timestamped text.
  • Entity extraction (NER)
    Detect people, places, dates, and organizations automatically.
AI text detection overlaying a historical manuscript — detected regions in green, selected passage highlighted
CAPABILITIES

A Complete Repository, Not Just a Viewer.

Multi-format Ingestion

PDFs, images, scans, audio, video, and physical document batches — ingested at scale through bulk import.

Rich Metadata & Cataloguing

Hierarchical collections, controlled vocabularies, and descriptive metadata for precise discovery and retrieval.

Cited Q&A over Any Corpus

Conversational answers grounded in your own holdings, each with a verifiable source link and page reference.

Media Library

Audio and video sit alongside documents — transcribed, indexed, and searchable from the same unified interface.

Self-Improving Models

Every human correction feeds back into the engine — accuracy compounds on your specific collection over time.

Instant Web Viewer

A fast, RTL-native reading experience with page navigation, thumbnails, and side-by-side text and image.

INTEROPERABILITY

Built on Open Standards.

Jazyl Archive speaks the language of libraries, archives, and preservation — so your collection stays portable, harvestable, and future-proof.

OAI-PMH harvesting Dublin Core metadata METS / MODS structure DOI persistent IDs PDF/A archival format LOCKSS / CLOCKSS preservation REST API integration
High-fidelity scan of a historical Arabic newspaper page preserved in the repository
SECURITY & SOVEREIGNTY

Your Heritage Never Leaves Your Walls.

Designed for national archives, ministries, and institutions with strict data residency. Deploy on-premise or in a private cloud, with enterprise identity and full auditability.

  • Role-based access (RBAC)
    Granular permissions per collection, document, and action.
  • Enterprise SSO
    LDAP and Shibboleth integration with your existing identity stack.
  • Sovereign / private hosting
    On-premise or air-gapped — data stays within your jurisdiction.
SEE IT ON YOUR OWN COLLECTION

Let's Unlock Your Archive.

Bring us a handful of your hardest pages — a brittle manuscript, a faded newspaper, an old registry. We'll show you Jazyl read, search, and answer over them, live.

Talk to an Expert