Generative LLM Inference Directly on Encryption — the model never sees the plaintext.
An end-to-end fully-homomorphic-encryption (CKKS) encrypted-transformer stack, paired with a bit-exact, FPGA-validated hardware accelerator, built on a freedom-to-operate-clean, proprietary ring. This is confidential AI inference for regulated data — not blockchain.
Glide lets you run a transformer model over data that stays encrypted end to end. The data is encrypted before it leaves the owner, the computation happens directly on ciphertexts under FHE, and only the data owner can decrypt the result. The server — and VaultBytes — never see the plaintext. The heavy cryptographic operation (the CKKS key-switch and its number-theoretic transforms) is offloaded to a dedicated hardware accelerator so encrypted inference becomes practical, not just possible.
Most of the FHE industry is building confidential blockchain — encrypted smart contracts and on-chain privacy. Glide targets a different problem: encrypted AI inference for regulated data. And unlike software-only or hardware-only efforts, it pairs a working encrypted-LLM stack with a synthesizing accelerator, on a ring chosen to be freedom-to-operate clean against the crowded CKKS patent landscape.
Every result is checked bit-for-bit against a reference. We report what is measured and flag what is not yet built.
End-to-end FHE inference demonstrated on a real transformer — encrypted attention and softmax in the ciphertext domain.
CKKS key-switch engine — forward and inverse NTT both bit-exact in simulation; synthesizes and place-&-routes on FPGA.
Our proprietary ring routes around the CKKS patent thicket — a freedom-to-operate position and an integration head start.
Every result is checked bit-for-bit against a reference. We report what is measured and flag what is not yet built.
Organisations that need to run AI over data they are not allowed to expose in plaintext, even to the inference provider.
| Finance | Inference over customer / transaction data without decrypting it on the server. |
|---|---|
| Healthcare | Models over patient records under strict confidentiality and data-residency rules. |
| Defense & government | Sovereign, confidential AI compute on sensitive data; air-gap-friendly trust model. |
Glide is in active development. The encrypted-LLM stack and the accelerator's NTT engine are validated and bit-exact; full key-switch integration, production silicon, multi-layer end-to-end runs, and production-scale parameters are the next milestones.
Stated plainly: FPGA-validated and bit-exact today; production throughput is currently batch-oriented and silicon-gated. Patent posture is freedom-to-operate clean with a small set of candidate filings pending counsel — not granted patents. We would rather under-claim and show the receipts.
For NDA evaluations, design partnerships, and pilots on encrypted inference. Tell us the model, the regulated data, and the timeline — we reply within one business day.