Skip to main content
R
Ruby
Document Processing

Receipts, Invoices, and Source Documents Processed in Minutes, Not Days of Chasing

70% reduction in document processing time, 85%+ auto-classification accuracy, zero lost documents

Admin Assistant ($30-40K/yr) for document handling — replaced Deploys in 4-6 weeks

The problem

Document collection and processing is the friction point that slows every engagement. Tax returns cannot start until all source documents arrive. Bookkeeping backs up when receipts trickle in weeks after transactions posted. Year-end close stalls while your team chases clients for bank statements, loan agreements, and asset schedules. The average accounting firm spends 30% of its total engagement time on document collection and processing rather than the substantive work the client is actually paying for.

The documents arrive in chaos. Clients email receipts as phone photos, forward bank statements as PDF attachments, upload invoices to shared folders, and occasionally mail physical documents that need scanning. Each document must be identified (what is it — a 1099-NEC, a W-2, a bank statement, a vendor invoice?), classified (which client, which period, which GL account?), validated (is it complete, legible, and relevant?), and routed to the correct workflow. This triage is entirely manual at most firms.

The client-chasing compounds the processing burden. When documents are missing or incomplete, staff send follow-up requests — often multiple times for the same W-9 or K-1. A typical tax engagement requires three to five follow-up requests before all documents are collected. During tax season, document chasing consumes entire staff members' capacity, creating a bottleneck that delays every return in the queue.

Ruby automates document triage and processing: extracting data from incoming documents using OCR and intelligent parsing, classifying and routing documents to the correct client and engagement, validating completeness against PBC checklists, and generating specific follow-up requests for missing items. Your staff processes the exceptions while Ruby handles the volume.

Admin Assistant ($30-40K/yr) for document handling — replaced
That is why you need Ruby.

How it works

How Ruby works, step by step

Each step is automated. Ruby only escalates when human judgment is required.

1
Documents arrive via email forwarding, client upload portal, or Dext/Receipt Bank integration

Ruby ingests the document, applies OCR to extract text and data, identifies the document type (W-2, 1099-NEC, 1099-INT, K-1, bank statement, vendor invoice, receipt, mortgage interest statement), and extracts key metadata: date, amount, vendor/payer, EIN/TIN, tax withholdings, and reference numbers

2
Document type and metadata are extracted

Ruby matches the document to a client and engagement based on sender information, extracted EIN/TIN matching, entity name recognition, and contextual clues. Documents are classified against the engagement's PBC checklist and filed to the correct folder in the document management system

3
Document matched and classified successfully

Ruby writes the processed data into the appropriate accounting platform or stages it for the tax prep workflow, attaching the original document and linking it to the relevant transaction, period, or engagement task. For vendor invoices, Ruby codes to the correct GL account based on historical patterns

4
Document cannot be automatically matched, is illegible, or appears incomplete

Ruby flags the document for staff review with its best-guess classification and the specific issue (e.g., "1099-NEC from vendor appears to show $12,400 but amount field is partially obscured" or "Bank statement is for October but November was requested on the PBC list")

5
Engagement PBC checklist has outstanding items and the deadline is approaching

Ruby generates a personalized document request for the client listing exactly what is still needed: "We still need your 2024 1099-NEC from XYZ Consulting (showing freelance income) and your December 2024 Chase Business checking statement showing the ending balance" — not generic "please send missing documents"

6
Client responds to a document request with new uploads

Ruby processes the newly received documents against the outstanding PBC checklist, updates the engagement status, and notifies the assigned accountant when all required documents are collected and ready for substantive work to begin

What Ruby handles vs. what stays with you

Clear boundaries. Ruby works autonomously within defined limits and escalates everything else.

Ruby handles
  • Ruby ingests the document, applies OCR to extract text and data, identifies t...
  • Ruby matches the document to a client and engagement based on sender informat...
  • Ruby writes the processed data into the appropriate accounting platform or st...
  • Ruby flags the document for staff review with its best-guess classification a...
boundary
Your team handles
  • Staff review and resolve all documents that Ruby cannot confidently classify or match to a client
  • Client communication about sensitive document issues (discrepancies, potential fraud indicators) is handled by firm staff
  • Data extraction accuracy for tax-impacting documents (W-2 wages, 1099 amounts, K-1 allocations) is verified by staff before informing returns
  • Engagement timeline decisions and deadline management remain with the engagement manager
  • Client relationship context (knowing a client is going through a difficult period) informs staff-handled follow-ups, not automated requests

Integrations

Works inside your existing tools

Ruby connects to the platforms you already use. No new software to learn.

Dext Reads from
QuickBooks Writes to
Xero Writes to
Slack Writes to

Implementation

From zero to Ruby

Ruby is deployed gradually with measurable checkpoints at every stage.

Deploy time
4-6 weeks
Monitoring mode first, then gradual rollout
📋
Data required
  • Email forwarding configuration and client upload portal access
  • Dext or Receipt Bank API credentials for document ingestion
  • Client entity details for matching (EIN, TIN, entity names, addresses)
  • Engagement PBC checklists by service type (tax return, bookkeeping, year-end)
  • Accounting platform API credentials for data write-back
🚀
Pilot process

Pilot processes documents for 15-20 clients across one engagement cycle (one month of bookkeeping or one batch of tax returns). Ruby classifies and extracts in parallel with manual processing for accuracy comparison.

Full validation before production deployment

Your AI team

Works alongside Ruby

These AI employees share data and coordinate with Ruby to cover your full operation.

R

Deploy Ruby for your accounting operations

Start with a 90-minute discovery session. We will assess whether Ruby is the right fit for your workflows and show you exactly what changes.