Building an AI Threat Analytics Framework: A Developer's Journey

carocsteads
Dec 12, 2025
4 min read

Updated: Dec 16, 2025

How I created a security-focused AI testing framework with Python, pytest, and real-world threat detection

By Carolina Steadham | QA Automation Engineer

Introduction

Security is one of the most critical concerns in today's digital landscape. With AI systems becoming increasingly prevalent in security operations, I set out to build a comprehensive AI Threat Analytics Framework - a proof-of-concept project that demonstrates how AI can be leveraged for threat detection, classification, and analysis.

This project showcases not just the potential of AI in cybersecurity but also demonstrates professional software engineering practices: clean code structure, comprehensive testing, and thorough documentation.

What I Built

The AI Threat Analytics Framework is a Python-based testing and demonstration platform that includes:

🤖 AI-Powered Features:

Autofill Service - Intelligent email suggestion generation
Threat Classifier - Keyword-based detection for phishing, malware, and spam
LLM Guardrails - Security controls to prevent prompt injection and filter PII
Report Summarizer - Automated text summarization for security reports

📊 Data Pipeline Components:

Anomaly Detection - Statistical z-score analysis to identify unusual patterns
Data Validation - Quality checks and filtering for data integrity
ML Pipeline Integration - End-to-end workflow from raw data to predictions

The Technical Stack

I kept the technology stack focused and practical:

Python 3.9+ - Core programming language
pytest ecosystem - Testing framework with HTML reporting, visual indicators, and enhanced output
Regular Expressions - Pattern matching for security threat detection
Statistics Module - Mathematical foundation for anomaly detection
No complex ML frameworks needed - just clean Python and smart algorithms.

Key Features That Make This Project Stand Out

1. Real Logic, Not Just Mocks

Unlike many tutorial projects, this framework uses actual working algorithms:

The anomaly detector uses genuine z-score statistical analysis
The threat classifier employs real keyword pattern matching
The guardrails use regex patterns to detect actual security threats

2. Testing Practices

I implemented 7 comprehensive tests covering:

4 AI/ML functionality tests
3 data pipeline integration tests
100% test coverage with detailed documentation

Each test includes:

Clear test cases with expected results
Real-world application examples
Complete documentation in markdown

3. Documentation

The project includes:

Test Plan - Complete testing strategy and approach
Test Cases - Detailed specifications with code examples
Traceability Matrix - Requirements mapped to test coverage
Setup Guide - Installation and configuration instructions

Real-World Applications

While this is a proof-of-concept, the framework demonstrates techniques applicable to:

✅ Email Security Screening - Detect phishing attempts before they reach users

✅ User Input Validation - Prevent prompt injection in AI chatbots

✅ Threat Pattern Detection - Identify malicious behavior patterns

✅ Security Report Automation - Summarize threat intelligence reports

✅ Anomaly Detection - Flag unusual user behavior or system activity

Example: How Anomaly Detection Works

Let me walk you through one of the coolest features - the statistical anomaly detector:

The Scenario: You have user login times throughout the day. Most logins happen between 9am-5pm. Suddenly, there's a login at 3am.

The Algorithm:

# Normal login times (in hours): 9, 9, 10, 12, 14, 16, 17

# Suspicious login: 3 (3am)

data = [9, 9, 10, 12, 14, 16, 17, 3]

anomalies = detect_anomalies(data, threshold=2.0)

# Result: Detects index 7 (the 3am login) as an anomaly

# Z-score: 2.8 (above our 2.0 threshold)

Real Impact:

This simple algorithm could flag unauthorized access, compromised accounts, or suspicious behavior patterns - all with basic statistics!

Lessons Learned

Keep It Simple

I could have used TensorFlow, PyTorch, or other heavy ML frameworks. Instead, I focused on clean Python and well-understood algorithms. The result? Fast, reliable, and easy to understand.

Documentation Matters

Writing comprehensive test documentation wasn't just busywork - it made the project presentable and helped me think through edge cases.

Testing Is Development

I didn't write tests after the code - I developed tests as the features. This test-driven approach caught bugs early and kept the codebase clean.

Real > Mock

Instead of faking everything with mocks, I implemented actual logic. It's more work upfront, but the result is a portfolio piece that actually does something.

The Numbers

📊 Project Stats:

7 working tests - 100% passing

5 core modules - Clean, documented code

4 documentation files - Professional-grade specs

~500 lines of production code - Quality over quantity

0 external API dependencies - Runs completely offline

⚡ Performance:

All tests complete in under 3 seconds

Anomaly detection processes 1000+ data points instantly

Zero network latency (all local processing)

What's Next? Keep Building!

This project is a foundation, not a finish line. Here are ideas to take it further:

🚀 Enhancement Ideas:

Add Real AI Models

Integrate with OpenAI, Anthropic, or Hugging Face APIs

Implement actual neural networks for classification

Add sentiment analysis to threat reports

Build a Web Interface

Create a Flask/FastAPI dashboard

Real-time threat monitoring visualization

Interactive test execution

Expand Detection Capabilities

Add more threat categories (ransomware, trojans, worms)

Implement multi-language support

Create custom ML models trained on security data

Production Hardening

Add comprehensive error handling

Implement logging and monitoring

Create CI/CD pipeline with GitHub Actions

Add performance benchmarking

Enterprise Features

Multi-user support with authentication

Database integration for threat history

Alerting and notification system

Export reports to PDF/CSV

Why This Project Matters

In a world where AI security is becoming critical, understanding how to build, test, and validate AI systems is invaluable. This project demonstrates:

✅ Practical AI application in security

✅ Professional software engineering practices

✅ Clear documentation and testing methodology

✅ Real algorithms solving actual problems

Whether you're a developer looking to break into AI security, a student building a portfolio, or a professional exploring new domains - this project shows that you don't need massive budgets or teams to build meaningful AI applications.

Get Started

The complete project is available with:

Full source code

Comprehensive documentation

Step-by-step setup guide

Working test suite

Requirements:

Python 3.9+

10 minutes for setup

No API keys or external services needed

Quick Start:

# Clone and setup

python3 -m venv venv

source venv/bin/activate

pip install -r requirements.txt

# Run all tests

pytest tests_ai/ tests_pipelines/ -v --emoji

# Generate beautiful HTML report

pytest --html=reports/test_results.html

Final Thoughts

Building this AI Threat Analytics Framework taught me that effective security tools don't have to be complex. Sometimes, the right algorithm with clean implementation beats an overengineered solution every time.

The intersection of AI, QA, and cybersecurity is growing rapidly. Whether you're interested in:

Machine learning

Security engineering

DevOps and testing

Software architecture

...there's something here to learn and build upon.

See the complete implementation with comprehensive guides.

View on GitHub →

Author: Carolina Steadham

Role: QA Automation Engineer

Date: December 2025

*Ready to discuss how automated testing can strengthen your security posture? Connect with me on [GitHub](https://github.com/steadhac)!*