SQL + Cassandra Data Storage | Data Layer

MySQL for structured relational data

All Primary Tables, Secondary Tables + entity metadata are stored in MySQL. Supports relational queries, joins + structured data management. Familiar SQL patterns, managed entirely within Rayven - no separate database administration required.

Cassandra for time-series + event data

All workflow payload data is automatically stored in Cassandra, indexed by UID, Node ID + Timestamp. Optimised for high-frequency writes, horizontal scalability + low-latency time-series reads - purpose-built for IoT telemetry, event streams + operational logs at any scale.

Hybrid architecture - unified access

MySQL and Cassandra coexist within the same platform layer. Dashboard widgets, workflow nodes + API endpoints query both via the same interface - no tool switching, no cross-database ETL, no schema synchronisation overhead.

Azure-hosted, geo-redundant

Both databases are hosted on Microsoft Azure with geo-redundant storage. Daily automated backups (60-day retention), weekly retention (one year) + monthly retention (three years). No single point of failure. Consistent performance regardless of geographic distribution.

Automated backup + retention policies

Daily backups run automatically with 60-day retention by default. Weekly + monthly retention extends historical coverage for compliance + audit. Data Repository node settings configure per-workflow retention at the workflow level.

Instant data availability

Data written to either database is immediately queryable. No refresh cycle, no replication delay, no batch sync. Dashboard auto-refresh operates on live data. API endpoints return current state. AI models train on up-to-date datasets.

The storage layer underpins every other capability in the platform - all data from the Integration Layer is stored here before any downstream use.

Data from the Integration Layer flows into MySQL (structured records) or Cassandra (event + time-series data) immediately on ingestion.
The Execution Layer reads from both stores for workflow logic, AI + automation.
The Presentation Layer queries both for dashboards, reports + custom interfaces.
API endpoints expose data from either store to external systems on demand.

Data storage is invisible to end users - it simply ensures data is always there, always current, always accessible.

Why does Rayven use both MySQL and Cassandra?

MySQL handles structured relational data - entity records, configuration tables, reference data. Cassandra handles high-frequency time-series writes - sensor readings, telemetry, event logs - where MySQL would bottleneck under volume. Each database is used for what it does best. See the full Data Layer.

When does data go into Cassandra vs MySQL?

IoT sensor readings, streaming events and any timestamped series data go into Cassandra. Structured entity records, operational logs with schema joins, and form submission records go into MySQL (Primary and Secondary Tables). See Unified Data Tables.

Can both databases be queried in the same workflow?

Yes. Workflow nodes can pull from Cassandra (time-series reads) and MySQL (table rows) simultaneously and merge the results for downstream transformation, dashboarding or AI model input. Explore Data Transformation.

How is data retention managed across both databases?

MySQL data retention is managed via scheduled deletion or archival workflows. Cassandra TTL (Time To Live) settings auto-expire records after a configured period. Retention windows are configurable per data type and business requirement. See Data Management.

Is my data backed up?

Yes. Both MySQL and Cassandra are backed up on a managed schedule. Backup frequency and retention windows are defined in your service agreement. Contact us for backup and recovery SLA details.

Can I access Rayven databases directly via SQL?

Read-only SQL access to MySQL is available for authorised integrations, enabling external BI tools to query Rayven data directly. For Cassandra, data is accessed via API endpoints or the Rayven workflow layer. See API Endpoints.

How does Cassandra scale for high-frequency IoT deployments?

Cassandra is designed for horizontal scaling - adding nodes increases write and read throughput linearly. Rayven manages this scaling automatically. Thousands of assets writing simultaneously at high frequency is a standard deployment pattern. See IoT Devices + Protocols.

Are there limits on how much data can be stored?

Storage limits depend on your platform tier. Cassandra and MySQL storage scale with your contract. For high-volume storage requirements, contact us to confirm the appropriate configuration.

Can historical data be exported for use in external systems?

Yes. Any stored dataset - MySQL tables or Cassandra series - can be exposed via authenticated API endpoints or written to FTP/S3 via output nodes. Data is not locked into the platform. See the Integration Layer.

What happens to stored data if the platform is offline for maintenance?

Scheduled maintenance windows are communicated in advance. All stored data persists through maintenance periods. Data arriving during downtime is queued and processed on recovery, depending on integration type. Contact us for SLA details.

Give AI agents all your data.

Sectors

Job Roles

Overview

Set Engagements

Overview

We Partner With

Library

Our Company

Platform > Data Layer > Data Storage

SQL + Cassandra data storage.

One platform, two databases - no compromise.

What SQL + Cassandra Data Storage gives you.

MySQL for structured relational data

Cassandra for time-series + event data

Hybrid architecture - unified access

Azure-hosted, geo-redundant

Automated backup + retention policies

Instant data availability

Where SQL + Cassandra Data Storage fits in the Rayven Platform stack.

How SQL + Cassandra Data Storage gets used.

Hybrid data model for an industrial AI platform

Financial services BI platform without a data warehouse

Partner-operated multi-client platform on shared infrastructure

Rayven SQL + Cassandra Data Storage FAQs:

Why does Rayven use both MySQL and Cassandra?

When does data go into Cassandra vs MySQL?

Can both databases be queried in the same workflow?

How is data retention managed across both databases?

Is my data backed up?

Can I access Rayven databases directly via SQL?

How does Cassandra scale for high-frequency IoT deployments?

Are there limits on how much data can be stored?

Can historical data be exported for use in external systems?

What happens to stored data if the platform is offline for maintenance?

Also in the Data Layer:

Unified Data Tables

Data Management

Data Transformation

File Parsing

Calculation + Aggregation

AI Models + Training

Real-time Data Processing

Discover the easy way to do something new.