CanucktAI
Canuckt AI

The Intelligence Behind
Canadian Privacy

We built the AI infrastructure that powers Shielk and Valdra — a proprietary Canadian NER engine, purpose-built compliance agents, and a legal knowledge base that covers all 34 Canadian privacy laws.

Shielk
Securityby Canuckt
AI Privacy Proxy
Powered by Canuckt AI
Valdra
Complianceby Canuckt
Compliance Automation
What Canuckt AI Does

Purpose-Built for Canadian Privacy

Every component of Canuckt AI was built specifically for Canadian law — not adapted from US tools.

Canadian PII Detection

A proprietary NER engine trained exclusively on Canadian legal and regulatory documents. Detects 421+ Canadian PII types that no off-the-shelf model can find.

Bilingual Processing

Native English and French support. Quebec French-specific patterns — RAMQ, NAM, Quebec court numbers, Law 25 terminology. Not a translation layer.

Legal Knowledge Base

A continuously updated knowledge base covering PIPEDA, all 13 provincial privacy acts, CASL, FINTRAC, and 34+ Canadian regulations with section-level citations.

Compliance Agents

8 specialized AI agents that understand Canadian law: document drafting, breach triage, vendor risk, gap analysis, regulatory monitoring, and more.

Compliance Certification

Auto-generated PIPEDA compliance certificates with section-level citations. Maps every detection to the 10 Fair Information Principles.

Zero-Knowledge Architecture

All PII processed in RAM and purged in under 2 seconds. Zero data retention. Zero training on your data. Canadian-controlled infrastructure.

The Results

Performance That Speaks for Itself

Benchmarked against leading off-the-shelf NLP models on held-out Canadian PII test sets.

421+
Canadian PII Types
Covering every province and territory
34
Privacy Laws
Federal + all 13 provinces
95%+
Detection Accuracy
On Canadian PII test sets
0
Bytes Stored
Purged in under 2 seconds
Under the Hood

GPU-Trained on Canadian Regulatory Data

We didn't fine-tune a general model. We built a custom NER pipeline trained exclusively on Canadian legal and regulatory documents — IRCC forms, OPC decisions, PIPEDA rulings, Law 25 guidance.

1
Step 01

Corpus Collection

31 real Canadian regulatory documents: IRCC forms, OPC enforcement cases, PIPEDA decisions, CAI rulings, Law 25 guidance. No synthetic data.

2
Step 02

Human Annotation

1,491 entities manually labelled across 244 Canadian PII types — SIN, RAMQ, NAM, passport numbers, work permit IDs, provincial health card numbers.

3
Step 03

GPU Training

Transformer-based NER fine-tuned on CUDA-accelerated compute. Custom inference pipeline handles bilingual entities, partial redactions, and contextual disambiguation.

4
Step 04

Production Inference

Containerized GPU inference on Canadian servers. Sub-200ms median response. Results purged from RAM on request completion. Zero data retention.

Benchmark: Canadian PII Detection (F1)
Model
F1 Score
Data Jurisdiction
spaCy en_core_web_lg
~40%
N/A (generic)
GPT-4 API
~65%
US (OpenAI)
AWS Comprehend
~55%
US (Amazon)
Canuckt NER v3
92.4%
🍁 Canada

NER model F1 measured on held-out Canadian PII test set. Off-the-shelf scores are Canuckt internal benchmarks on the same test set. End-to-end product detection accuracy (NER + rule-based recognizers): 95%+.

Technology Stack
Python 3.11PyTorchTransformersspaCyCUDAHugging FaceFastAPIDockerPostgreSQLNumPyscikit-learnUbuntu 22.04
92.4%
NER Model F1
421+
Total Recognizers
<200ms
Inference
NVIDIA Inception Program MemberPowered by NVIDIA
GPU infrastructure
Current Production Model
Canuckt NER v3 — Canadian Privacy Edition

Transformer architecture fine-tuned on a proprietary corpus of Canadian legal and regulatory documents. NER model covers 244 entity types; combined with rule-based recognizers the full detection system reaches 421+ Canadian PII types. Covers PIPEDA, Law 25, PHIPA, PIPA, FINTRAC, IRCC document types, and bilingual Quebec-specific patterns.

NER Model F1 Score92.4%
0%100%
0.4%
NER Model F1
0+
Total Recognizers
0
NER Entity Types
v0
Model Version
8 Compliance Agents

AI That Knows Canadian Law

Every agent is powered by Canuckt AI with a proprietary Canadian legal knowledge base. Every answer cites the actual regulation. Every document is legally accurate.

Agent 1

Valdra Copilot

Answers any Canadian compliance question in context of your profile. Cites PIPEDA Schedule 1, Law 25 sections, CASL articles.

Canuckt AI powered
Agent 2

Document Drafter

Generates Privacy Policies, PIAs, DPAs, CASL consent forms, OPC breach reports — bilingual, legally accurate, tailored to your org.

Canuckt AI powered
Agent 3

Breach Triage

Conversational RROSH assessment. Knows PIPEDA vs. Law 25 differences. Starts OPC and CAI countdown clocks automatically.

Canuckt AI powered
Agent 4

PII Scanner

Leverages Shielk's full detection engine. Auto-populates your Data Inventory from scan results. 100% local — zero data leaves Canada.

Canuckt AI powered
Agent 5

Vendor Risk

Reviews vendor SOC 2 reports, flags CLOUD Act exposure for US vendors, generates and sends DPAs automatically.

Canuckt AI powered
Agent 6

Questionnaire Responder

Answers inbound security questionnaires from enterprise customers. 70-80% auto-answer rate from your compliance profile.

Canuckt AI powered
Agent 7

Regulatory Watch

Monitors OPC, CAI, CRTC, and Parliament daily. Summarizes changes in plain language. Alerts you when your org is affected.

Canuckt AI powered
Agent 8

Gap Analyzer

Continuous gap analysis after every profile change. Prioritized output with fix time estimates and fine risk calculations.

Canuckt AI powered
🍁

All agents operate on Canadian servers. Legal knowledge base covers PIPEDA, Law 25, CASL, OPC guidelines, CAI rulings — continuously updated.

One Engine. Two Products.

Powers Both Shielk and Valdra

The same Canuckt AI engine drives both products — but they solve completely different problems.

Shielk
Securityby Canuckt

Shielk

AI Privacy Proxy — anonymize PII before any AI, de-anonymize after. PIPEDA compliance certificates. Zero data stored.

Explore Shielk
Valdra
Complianceby Canuckt

Valdra

Compliance automation — PIPEDA assessments, AI document generation, breach autopilot, CASL management, vendor risk tracking.

Explore Valdra
Both powered byCanuckt AI·100% Canadian🍁
🍁100% Canadian Infrastructure

Built in Canada. For Canada.

Interested in partnering, licensing our detection engine, or building on the Canuckt AI platform? We'd like to hear from you.

Nova Scotia, Canada · Data never leaves Canadian borders

Canuckt AI — The Engine Behind Canadian Privacy | Canuckt AI