File Parsing | Data Layer | Rayven Platform

AI-powered document extraction

Pass PDFs, Word documents + text files to an LLM connector node (OpenAI, Claude, Gemini + others) for structured data extraction. The AI reads the document and returns configured fields as structured JSON - no manual template mapping required.

CSV + structured file parsing

Ingest CSV, XML + JSON files from FTP, SFTP or S3 and parse field values into workflow payloads automatically. Configure column mapping, data type conversion + validation rules to ensure structured output regardless of input format variation.

Email + attachment processing

Process emails and extract data from attached files automatically on receipt. Structured content from email bodies or attachments flows into workflows - useful for invoice processing, report ingestion + document-triggered automation.

Regex + validation rules

Apply regex patterns, field validation rules + mapping logic to incoming file data. Validate field formats on ingestion, flag anomalies + reject or flag records failing quality checks before parsed data reaches storage or downstream processing.

Extract JSON Key node

Extract specific values from nested JSON structures within a workflow. Supports deep nesting, wildcard key selection + array handling. Used when ingested files contain complex JSON with required data buried in nested objects or arrays.

Merged file + real-time data pipelines

Combine parsed file data with real-time streams in the same workflow. Merge uploaded file data with time-series data, API responses or Primary Table records - for example, combining a daily CSV report with live sensor readings for unified analysis.

File parsing nodes sit in the Data Layer, processing file content after ingestion from the Integration Layer.

Files arrive via FTP, SFTP, S3 or manual upload from the Integration Layer.
Parsing nodes extract, validate + structure file content within the workflow.
Structured output writes to MySQL or Cassandra for storage.
The Execution Layer uses parsed data for workflow logic, AI processing + automated actions.
The Presentation Layer surfaces parsed data in dashboards + reports.

What file types does Rayven parse?

CSV, JSON, XML, plain text, binary, and compressed formats (.zip, .gz). Configurable character encoding handles non-standard sets. Proprietary or non-standard structures can be handled via the JavaScript Node or Advanced Function Node. See Data Transformation.

How does Rayven ingest files for parsing?

Files are ingested via FTP, SFTP, S3, manual upload through a Rayven form, or HTTP POST. The ingestion method is set as the workflow trigger node. See File Uploads.

Can Rayven parse files with variable structures?

Yes. When file schemas vary, the JavaScript Node or Advanced Function Node handles dynamic structure detection and field extraction. This supports legacy report exports where column order or naming is inconsistent. Explore Data Transformation.

How does CSV parsing handle headers?

Rayven's CSV parser can detect headers from the first row or use a manually defined column mapping. Multi-row header structures, quoted fields and custom delimiters are all configurable per ingestion node. See Data Layer configuration options.

Can parsed data feed an AI model directly?

Yes. Parsed file content - including extracted text from documents - can feed directly into an AI/LLM node for classification, extraction or summarisation within the same workflow. See AI Models + Training.

Is there a file size limit for parsing?

There is no hard size limit in workflow configuration. Performance on very large files depends on the complexity of downstream parsing and transformation logic. Contact us for high-volume file processing requirements.

Can Rayven parse multiple files in a single workflow run?

Yes. File ingestion nodes can poll a directory and process all new files found in a single execution cycle. Each file is parsed and passed through the workflow independently within the same run. Explore the Execution Layer.

How are parsing errors handled?

The Error Handler and Conditional Filter nodes route parse failures to alternative paths - flagging the file for manual review, triggering an alert or storing the raw file without transformation. See Notifications + Alerts.

Can parsed data write directly to a database table?

Yes. Parsed and transformed data can be written to any Rayven Primary or Secondary Table via Push Row nodes. This makes file-based batch ingestion feed the same unified data structure as real-time sources. See Unified Data Tables.

Does Rayven parse data inside compressed archives?

Yes. The file ingestion node can decompress .zip and .gz files and extract individual files for parsing. The structure within the archive is flattened and each file processed through the workflow pipeline. See the File Uploads page.

Sectors

Job Roles

Overview

Set Engagements

Overview

We Partner With

Library

Our Company

File parsing.

Turn files into data, automatically,

What File Parsing gives you.

AI-powered document extraction

CSV + structured file parsing

Email + attachment processing

Regex + validation rules

Extract JSON Key node

Merged file + real-time data pipelines

Where File Parsing fits in the Rayven Platform stack.

How File Parsing gets used.

Automated invoice processing

Daily report ingestion for a retail BI platform

Partner building a document processing pipeline for a legal firm

Rayven File Parsing FAQs:

What file types does Rayven parse?

How does Rayven ingest files for parsing?

Can Rayven parse files with variable structures?

How does CSV parsing handle headers?

Can parsed data feed an AI model directly?

Is there a file size limit for parsing?

Can Rayven parse multiple files in a single workflow run?

How are parsing errors handled?

Can parsed data write directly to a database table?

Does Rayven parse data inside compressed archives?

Also in the Data Layer:

Unified Data Tables

Data Management

Data Transformation

Real-time Data Processing

Calculation + Aggregation

AI Models + Training

SQL + Cassandra Data Storage

Discover the easy way to do something new.