Versioned, audit-ready datasets designed for AI evaluation, regression testing, and security operations. Each snapshot is a frozen slice of reality — so if your model regresses, you can prove it.
"This snapshot will never change." Buyers can reproduce results, compare models over time, and separate data drift from model regressions.
Delivery: private S3 (signed links), or your preferred secure method • License: internal use / no redistribution
Immutable release containing all CISA KEV entries (as of snapshot date) plus validated non-KEV signals published during the month. Includes HTML reports, interactive visualizations, and a sample folder for evaluation.
Typical pricing: $500–$5,000 / snapshot (starter buyers)
Includes: JSONL, Parquet, JSON, HTML reports, visualizations, manifest, and sample folder (25-50 representative records)
Complete CISA Known Exploited Vulnerabilities catalog with normalization, NVD enrichment (CVSS scores, severity), and confidence scoring. All KEV entries as of the snapshot generation date.
Typical pricing: bundled or $250–$2,000 one-time
Note: KEV entries are included in monthly snapshots based on their status as of snapshot date, regardless of original publication date.
Rolling 30-day window of newly published non‑KEV CISA advisories from RSS feeds and web scraping. Excludes KEV entries (which appear only in full snapshots). May be quiet during low-activity periods.
Typical pricing: $50–$500 / month (subscription)
Sources: CISA RSS feeds + web-scraped advisory pages
Not just data — reproducibility, trust, and time savings. Each snapshot is a release artifact you can reference forever. That enables objective model comparisons, audit trails, and defensible decisions.
100% confidence indicates confirmed exploitation (KEV-backed ground truth). It does not mean highest impact, prevalence, or likelihood in a specific environment.
Each snapshot is delivered as a versioned folder with immutable artifacts:
The manifest includes record counts, coverage window, KEV status timestamp, and generation metadata. HTML reports provide interactive exploration. The sample folder contains 25-50 representative records for evaluation.
What you get: Not just raw data—you get normalized, enriched, scored, and documented datasets with HTML reports and visualizations ready for immediate use.
Because we don't pad the feed with UI pages, social links, or speculative content. Quiet periods are normal for authoritative sources like CISA and are a sign of signal integrity. We prioritize quality over quantity.
Free CISA feeds are raw, unprocessed, and constantly changing. Our snapshots are normalized (consistent schema, cleaned HTML, extracted CVEs), enriched (NVD CVSS scores, severity ratings), scored (explainable confidence factors), immutable (never change after release), and documented (HTML reports, visualizations, manifests). You're buying processed, reproducible data ready for ML/AI pipelines, not raw feeds.
Snapshots aren't for pretraining from scratch. They're for supervision, evaluation, continual updates, and audit-grade regression testing. Snapshots compound into a trusted corpus over time.
No. Snapshots are immutable. If corrections are needed, a new version (e.g., v2) may be issued while preserving prior versions for reproducibility.
Typically no. The license is internal-use and prohibits redistribution of raw records. Sharing derived outputs (models, analyses, reports) is allowed if it does not enable reconstruction of the dataset. For redistribution rights, contact us for custom licensing terms.
Each snapshot includes interactive HTML reports with searchable tables, charts showing source distribution, severity breakdown, CVSS scores, confidence scores, timeline analysis, and complete schema documentation. The visualizations help you understand the dataset structure and data quality before writing code.
Snapshots are generated monthly, typically within the first few days after the month ends to ensure completeness. Each snapshot includes all KEV entries as of the generation date (regardless of when they were originally published) plus all non-KEV signals published during that calendar month. The snapshot is then frozen and never modified.
Tell us what you're building and we'll recommend the right SKU (snapshot vs feed) and provide a sample.
Email: awarenesssoftwaregroup@gmail.com
Preferred info to include: use case (ML eval, detection engineering, vulnerability management), desired cadence (monthly snapshots vs. rolling feed), team size, and whether you need Parquet format.