Anomaly Detection System

for National Gallery Singapore

in association with

Anomaly Detection System

Our Product

Team Vectors built an anomaly detection system to augment and strengthen the existing security system at National Gallery of Singapore. This system aims to track abnormal user trajectory pattern and act as an alert system to pin down possible misadventure or criminal activity inside the gallery.

Understanding data

Exploratory Data Analysis

In statistics, exploratory data analysis (EDA) is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task.

Image
Image
Image
Image
Image
Image
Image
Image
Image

Get notified on each updates.

Implementation

Machine Learning Algorithms

Anomaly is be defined as any deviation from standard pattern. We have implemented K-Means and GMM machine learning algorithms to detect cluster outliers in order to find out possible anomalies.

Image
01.

K-Means Clustering

K-means algorithm identifies k number of centroids, and then allocates every data point to the nearest cluster, while keeping the centroids as small as possible.

The ‘means’ in the K-means refers to averaging of the data; that is, finding the centroid. Every data point is allocated to each of the clusters through reducing the in-cluster sum of squares.

To process the learning data, the K-means algorithm in data mining starts with a first group of randomly selected centroids, which are used as the beginning points for every cluster, and then performs iterative (repetitive) calculations to optimize the positions of the centroids.

View Details


View Evaluation Results


Silhouette Coefficient CH Index Davies Bouldin Index
0.3462 271511.42 1.0021
Model Accuracy Precision Recall F1 Score
Model for Level B1 0.8810 0.9023 0.8810 0.8784
Model for Level L1 0.7260 0.8210 0.7260 0.7041
Model for Level L5 0.8085 0.9197 0.8085 0.7418

View Visualizations and Results



1 / 6
2 / 6
3 / 6
4 / 6
5 / 6
6 / 6

Scatter with marginal plot
Hexbin with marginal plot
2D countour with marginal plot
Gaussian KDE plot
Density plot
3D Scatter Plot
Accuracy Precision Recall F1 Score
0.8 0.86 0.8 0.79


1 / 6
2 / 6
3 / 6
4 / 6
5 / 6
6 / 6

Scatter with marginal plot
Hexbin with marginal plot
2D countour with marginal plot
Gaussian KDE plot
Density plot
3D Scatter Plot


1 / 6
2 / 6
3 / 6
4 / 6
5 / 6
6 / 6

Scatter with marginal plot
Hexbin with marginal plot
2D countour with marginal plot
Gaussian KDE plot
Density plot
3D Scatter plot
Accuracy Precision Recall F1 Score
0.66 0.79 0.66 0.61

1 / 6
2 / 6
3 / 6
4 / 6
5 / 6
6 / 6

Scatter with marginal plot
Hexbin with marginal plot
2D countour with marginal plot
Gaussian KDE plot
Density plot
3D Scatter Plot


1 / 6
2 / 6
3 / 6
4 / 6
5 / 6
6 / 6

Scatter with marginal plot
Hexbin with marginal plot
2D countour with marginal plot
Gaussian KDE plot
Density plot
3D Scatter Plot
Accuracy Precision Recall F1 Score
0.76 0.96 0.76 0.66

1 / 6
2 / 6
3 / 6
4 / 6
5 / 6
6 / 6

Scatter with marginal plot
Hexbin with marginal plot
2D countour with marginal plot
Gaussian KDE plot
Density plot
3D Scatter Plot
Image
02.

Gausian Mixture Models

A Gaussian mixture model is a probabilistic model that assumes all the data points are generated from a mixture of a finite number of Gaussian distributions with unknown parameters.

One can think of mixture models as generalizing k-means clustering to incorporate information about the covariance structure of the data as well as the centers of the latent Gaussians.

View Details


View Visualizations and Results



1 / 8
2 / 8
3 / 8
4 / 7
5 / 7
6 / 7
7 / 7

Histogram
Histogram
Histogram
1 Gaussian component
5 Gaussian components
18 Gassian components
3D scatter plot


1 / 3
2 / 3
3 / 3

AIC
Histogram/Log Likelihood
Anomalies


1 / 5
2 / 5
3 / 5
4 / 5
5 / 5

AIC
AIC
Histogram
Anomalies
Anomalies


1 / 3
2 / 3
3 / 3

AIC
Histogram/Log Likelihood
Anomalies
Image

Team Vectors

About Us

We are Masters student at University of Southern California majoring in Data Informatics. Vectors tells users about its product by describing how machine learning algorithms were used to detect anomalies. The application breaks down different pieces of the project into sections that combine research and implementation, painting a picture in the form of visualizations.

Team

Leadership

Image

Nadir Fathi

CEO, Kiana Analytics

Serial entrepreneur, CEO, and speaker. Leading IoT and Big Data Revolution

nader128@yahoo.com

Image

Anna Farzindar

Professor, USC

Faculty @USC Viterbi Engineering

farzinda@usc.edu

Image

Ashwini Giri

Team Leader

An inquisitive energetic Data Informatics Master's student at USC with a strong foundation in Data Analysis and Python.

agiri@usc.edu

Image

Kathakali Banerjee

Team Member

Data Science/ Android Application/Digital Marketing

kathakab@usc.edu

Image

Laksh Matai

Team Member

Research Assistant at USC CAIS

lmatai@usc.edu

Image

Pallavi Yenigalla

Team Member

Master of Science, Data Informatics, University of Southern California, Los Angeles

yenigall@usc.edu


Team Leader

Ashwini Giri

Contact Form

Get In Touch

Contact Form