Three‑Path 3D Network: Fast Video Crime Spotting Made Simple

A new lightweight TriPath3DNet architecture transforms brief security footage into near‑instant, highly accurate threat alerts.
Its core is a three‑layer ResNet that processes video through three distinct lenses:

Motion bursts – short, rapid changes in pixel intensity
Long‑term background clues – evolving scene context over time
Quick frame differences – pixel‑by‑pixel changes between successive frames

These complementary views enable the system to spot subtle anomalies—such as loitering or unauthorized entry—even amid lighting shifts and occlusions.

Performance Highlights

Dataset	Accuracy	AUC
Virat1‑RC	95 %	0.99
Virat2‑RC	94 %	0.98
UCF‑Crime	Outperformed transformer rivals

Inference speed: 130 ms per 50‑frame clip on a standard GPU
Model size: ~33 million parameters
Edge readiness: Designed for fast, low‑memory deployment on real‑world cameras

Why It Works

Three‑path synergy: Ablation studies confirm every path contributes unique value.
Spatial–temporal focus: Grad‑CAM visualizations show the network attends to relevant areas across both space and time.
Real‑world suitability: Built with factory, mall, and public‑space cameras in mind—fast processing, minimal memory footprint, edge compatibility.

TriPath3DNet demonstrates that a lightweight, multi‑view approach can deliver superior detection rates in challenging datasets, bringing practical real‑time surveillance closer to everyday deployment.

Performance Highlights

Why It Works

Actions