ReaLLM orchestration and data privacy.
Plix orchestrates ReaLLM - FalconDive's domain-trained, open-source LLM - end-to-end. Your data stays inside. No external model is ever in the loop.
Plix orchestrates ReaLLM - your data stays inside.
One LLM. Fully in-house. ReaLLM authors the SQL, Plix validates every query, ReaLLM analyzes results - all within the FalconDive boundary.
What ReaLLM works on - and what never leaves.
ReaLLM works on your data - but only inside FalconDive. Plix ensures nothing crosses back out.
What ReaLLM sees
- User's natural-language question + database schema
- SQL it generates - validated by Plix before execution
- Full DataFrame returned by the connector
- Written analysis it produces for the user
What stays sealed
- Query result rows - raw or sampled
- Confidential column values from your warehouse
- Database credentials and connection strings
- ReaLLM's generated SQL or analysis text
- Any aggregation, summary, or derivative
Two layers of validation - before any query fires.
SQL Safety Gate
Plix's first defense - invoked by every connector before a query is fired.
- 01
Strip comments
Remove line comments and block comments so they cannot smuggle forbidden keywords.
- 02
Block forbidden verbs
Regex with word boundaries rejects any query containing DELETE, UPDATE, INSERT, DROP, TRUNCATE, ALTER, CREATE, GRANT, REVOKE, MERGE, REPLACE, CALL, EXEC, EXECUTE, or INTO.
- 03
Require safe prefix
Query must start with SELECT, WITH, SHOW, DESCRIBE, or EXPLAIN - anything else is rejected.
- 04
Raise SQLValidationError
On any violation the query never reaches the database. The user receives a clear error.
Per-database connectors
All share BaseConnector and the same SQL safety gate. Each runs against the customer's own warehouse using the customer's own credentials.
Athena
AWS Athena / PyAthena async cursorBigQuery
google.cloud.bigquery / service account JSONPostgres
SQLAlchemy + psycopg2Snowflake
SQLAlchemy + snowflake-connector{ state, result: DataFrame, data_scanned_in_bytes, query_cost, query_id, error } One real request, hop by hop.
"How many active enterprise customers signed up last quarter?" - 01 User to Plix to ReaLLM Natural-language question + table schema Schema and question stay inside FalconDive.
- 02 ReaLLM to Plix Validator Plain SELECT generated by ReaLLM Passes the SQL safety gate - forbidden verbs already rejected.
- 03 Plix Connector to DB Validated SQL executes against the customer warehouse Cost pre-estimated. DataFrame returned to the connector.
- 04 Plix to ReaLLM - analysis Full DataFrame handed back to ReaLLM for domain-aware analysis Data and analysis stay in-house throughout.
- 05 ReaLLM to User DataFrame plus written analysis - trends, outliers, recommended next query Loop closes. No external system was ever in the loop.
Five guarantees you can put in front of audit.
Plix is the audit-ready orchestrator - routing ReaLLM through SQL authoring, validation, and analysis, all in-house.
No external model in the loop
ReaLLM is FalconDive's open-source LLM, retrained on the business domain and hosted in-house. No third-party model ever sees your data.
Read-only by construction
DELETE, UPDATE, INSERT, DROP, TRUNCATE, ALTER, CREATE, GRANT, REVOKE, MERGE, REPLACE, CALL, EXEC, EXECUTE, and INTO are blocked at the gate.
Analysis stays in-house
ReaLLM is open-source, domain-trained, and FalconDive-hosted. The full DataFrame and its analysis never leave the boundary.
Customer-controlled credentials
Each connector authenticates with credentials managed in the customer's environment. ReaLLM never touches them.
Cost and scan transparency
data_scanned_in_bytes and query_cost are returned per query for monitoring and budget controls.