Temporal Anomaly Localızatıon In Vıdeo

Öztürk, Halil İbrahim

View/Open

10427574.pdf (4.407Mb)

Date

2021-10-14

Author

Öztürk, Halil İbrahim

xmlui.dri2xhtml.METS-1.0.item-emb

Acik erisim

xmlui.mirage2.itemSummaryView.MetaData

Show full item record

Abstract

Detecting anomalies in surveillance videos is an important research problem in computer vision. In this thesis, we propose two deep network architectures for anomaly detection, Anomaly Detection Network (ADNet) and Anomaly Detection Network by Object Relations (ADOR). ADNet utilizes temporal convolutions to localize anomalies in videos. The model works online by accepting consecutive windows of video clips. Features extracted from video clips in a window are fed to ADNet, which allows to localize anomalies in videos effectively. We propose the AD Loss function to improve abnormal segment detection performance of ADNet. ADOR employs an object detector and spatio-temporal feature extractor to fuse object relations and action information. Fusion is achieved with cross attention layers which use attention memory from cross encoders. Additionally, we propose to use F1@k metric for temporal anomaly detection. Segment based F1@k is a better evaluation metric than frame based AUC in terms of not penalizing minor shifts in temporal segments and punishing short false positive temporal segment predictions. Furthermore, we extend UCF Crime dataset by adding two more anomaly classes and providing temporal anomaly annotations for all classes. Finally, we thoroughly evaluate our model on the extended UCF Crime dataset. ADNet and ADOR produce promising results according to the F1@k metric.

URI

http://hdl.handle.net/11655/25640

xmlui.mirage2.itemSummaryView.Collections

Bilgisayar Mühendisliği Bölümü Tez Koleksiyonu [267]

The following license files are associated with this item:

Creative Commons

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess