IEEE Access, vol.12, pp.84548-84555, 2024 (SCI-Expanded)
CCTV (Closed Circuit Television) surveillance systems, long associated with physical security, are becoming more crucial when combined with cybersecurity measures. Combining traditional surveillance with cyber defences is a flexible method of protecting against both physical and digital dangers. This research introduces the use of Convolutional Neural Networks (CNNs) and hand gesture detection using CCTV data to perform real-time security risk assessments. The suggested method’s emphasis on automated extraction of key information, such as identity and behaviour, illustrates its special use in silent or acoustically challenging settings. This study uses deep learning techniques to develop a novel approach for detecting hand gestures in CCTV images by automatically extracting relevant features using a Media-Pipe architecture. In noisy environments or when the audio stream is muted, for example, it enables risk assessment via the use of hand gestures. Given the uniqueness and efficiency of this method, the suggested solution will be able to alert appropriate authorities in the case of a security breach. There seems to be considerable opportunity for the development of applications in several domains of security, law enforcement, and public safety, including but not limited to shopping malls, educational institutions, transportation, the armed forces, theft, abduction, etc.