CaptureDocument extraction and fraud detection. 50+ document types, any format.Read the docs
01Document layerStandalone API · Any document

Kita Capture

Every document. Every number. Every fraud signal — in seconds.

Pulls credit signals from any document — typed, handwritten, photo, scan — and unifies them into one structured view ready for the decision. Not raw data for your team to clean.

Accuracy98%
Per document<30s
Document types & counting50+
Used acrossMicrofinance·SME & commercial·CDFI

Processing documents across the markets we serve

Bank statementsE-wallet recordsGovernment IDsTax documentsPayslipsAudited financialsBusiness financialsCredit reportsUtility billsMobile money recordsInsurance documentsInvoices & receiptsLoan applicationsProperty deedsVehicle registrationsBank statementsE-wallet recordsGovernment IDsTax documentsPayslipsAudited financialsBusiness financialsCredit reportsUtility billsMobile money recordsInsurance documentsInvoices & receiptsLoan applicationsProperty deedsVehicle registrationsBank statementsE-wallet recordsGovernment IDsTax documentsPayslipsAudited financialsBusiness financialsCredit reportsUtility billsMobile money recordsInsurance documentsInvoices & receiptsLoan applicationsProperty deedsVehicle registrations
02How it worksSend · Read · Verify · Return

Pixels in. Structure out.

Kita Capture reads any document with vision language models — typed, handwritten, photo, or scan — and returns clean, schema-checked structured data. Fraud signals run alongside extraction, not after.

bpi_statement_oct.pdf · page 3 of 14
Extracting
BPI

Bank of the Philippine Islands

Statement of Account

01 Oct 2025 — 31 Oct 2025

Account holderMaria Santos Reyes
Account no.8432-1184-22
CurrencyPHP
Total credits₱ 1,247,852.10
Total debits₱ 1,809,460.88
Closing balance₱ 640,552.77
DateDescriptionAmountBalance
06 OctSalary credit · Reyes Trading Co.+ 142,500.00492,103.55
07 OctGCash transfer · 091772342788,432.101,247,852.10
12 OctCASH DEPOSIT · branch-0142+ 1,809,460.881,809,460.88
15 OctSSS contribution4,250.001,803,210.88
21 OctLoan repayment · BPI Personal Loan18,492.00916,331.55
Structured output / bank_statementSchema OK
{
  "applicant": {
    "name":   "Maria Santos Reyes",
    "account_no": "8432-1184-22",
    "institution": "BPI"
  },
  "period": { "start": "2025-10-01", "end": "2025-10-31" },
  "cashflow": {
    "opening":  482183.65,
    "inflow_total":  1247852.10,
    "outflow_total": 1809460.88,
    "closing":  640552.77,
    "net_recurring_credits": 142500.00
  },
  "transactions": 73,
  "confidence": 0.98
}
Fraud signals1 signal
Font mismatch on transaction linepage 2 — 6th row · low
Statement totals reconcileopening + credits − debits = closing
Document metadata intactPDF producer matches BPI fingerprint
No duplicate of prior submitted fileperceptual hash unique vs 27 prior files
01 · Send

Upload via API, SDK, or portal. Any format. PDFs, photos, scans, screenshots.

02 · Read

A vision language model reads the document holistically. Layout, context, tables, handwriting.

03 · Verify

Tampering, metadata inconsistencies, and forgery patterns calibrated to each market.

04 · Extract

Credit signals out, not raw fields. Metrics, qualitative reads, metadata, fraud, ready for the underwriter.

03Capabilities

Credit signals out, not raw data.

Capture reads context, structure, and content the way a trained analyst would. Then it returns the credit signals your underwriters actually use, metrics, qualitative reads, metadata, fraud.

  • Credit-signal extractionMetrics, qualitative reads, metadata, and fraud, every kind of signal an underwriter needs, pulled from one document and ready to feed your policy engine.
  • Reads like an analystLayout, tables, handwriting, context. Catches the things a template OCR misses, including the things written between the lines.
  • Fraud detectionTampering, metadata inconsistencies, and forgery patterns calibrated to each market and document type.
  • Cross-document verificationReferences data points across multiple documents in the same file to catch contradictions, restating income, mismatched IDs, doctored balances.
  • Multi-format supportPDFs, photos, scans, screenshots, handwritten records, processed identically. No pre-processing required.
  • Market-calibratedTuned to local document formats across Southeast Asia, Latin America, and the United States. Not generic.
capture_response.json200 OK
{
"document_type": "bank_statement",
"institution": "BDO Unibank",
"period": "Sep–Dec 2025",
// metrics
"avg_monthly_balance": 142580.50,
"net_inflow_3mo": 485200.00,
"dscr": 2.4,
// qualitative reads
"income_pattern": "stable_recurring",
"unusual_activity": false,
// metadata + fraud
"fraud_score": 0.02,
"tampering_detected": false
}
04Coverage50+ document types

Any document. Any format. No templates.

Capture handles any document format out of the box. No templates, no pre-configuration required.

Bank statements
E-wallet records
Government IDs
Tax documents
Payslips & income
Business financials
Credit reports
Utility bills
Mobile money
Invoices & receipts
Employment records
Insurance documents
05IntegrationDeploy in minutes
i.

Python SDK

Install, authenticate, and start processing in under 10 lines. Type hints and async support included.

ii.

REST API

Language-agnostic HTTP endpoints with webhook callbacks. Output in JSON, CSV, Excel, or custom. OpenAPI spec included.

iii.

Web portal

No-code portal to upload documents, review extractions, and export results. No engineering required.

06Signal validationOn request, for lenders with a book

Want to know which signals actually predict default? We'll prove it on your book.

For lenders with enough history to test against, we replay Capture across your historical applications and match every extracted signal to your repayment outcomes, at any DPD horizon you track. You get back which signals split risk cleanly across borrower deciles, and which are noise. A capability, not a prerequisite.

i.

Replay on your historical applications

Hand us a sample of historical loans and we re-run Capture across the documents, pulling the full signal pack on every borrower file.

ii.

Match each signal to repayment outcomes

At any DPD horizon you track. Early delinquency at DPD 1 or 4, mid-book at DPD 30 or 60, charge-off at DPD 90 and beyond. Per-signal, per-window.

iii.

Ranked report, what splits risk

Each signal gets an Information Value score, monotonic decile splits, and a strength label. You see which signals belong in your scorecard and which don't move the needle.

07Try itFree credits in the portal

Underwrite more borrowers. Faster. With cleaner data.