Combination of fast hybrid classification and k value optimization in k-nn for video face recognition


  • Nuning Septiana Institut Teknologi Sepuluh Nopember, Surabaya
  • Nanik Suciati Institut Teknologi Sepuluh Nopember, Surabaya



face recognition, Fast Hybrid Classification, k-NN, video


Nowadays, the need for face recognition is no longer include images only but also videos. However, there are some challenges associated with the addition of this new technique such as how to determine the right pre-processing, feature extraction, and classification methods to obtain excellent performance. Although nowadays the k-Nearest Neighbor (k-NN) is widely used, high computational costs due to numerous features of the dataset and large amount of training data makes adequate processing difficult. Several studies have been conducted to improve the performance of k-NN using the FHC (Fast Hybrid Classification) method by optimizing the local k values. One of the disadvantages of the FHC Method is that the k value used is still in the default form. Therefore, this research proposes the use of k-NN value optimization methods in FHC, thereby, increasing its accuracy. The Fast Hybrid Classification which combines the k-means clustering with k-NN, groups the training data into several prototypes called TLDS (Two Level Data Structure). Furthermore, two classification levels are applied to label test data, with the first used to determine the n number of prototypes with the same class in the test data. The second classification using the optimized k value in the k-NN method, is employed to sharpen the accuracy, when the same number of prototypes does not reach n. The evaluation results show that this method provides 86% accuracy and time performance of 3.3 seconds.

Author Biographies

Nuning Septiana, Institut Teknologi Sepuluh Nopember, Surabaya

Department of Informatic Engineering

Nanik Suciati, Institut Teknologi Sepuluh Nopember, Surabaya

Department of Informatic Engineering


H. Idrees, M. Shah and R. Surette, "Enhancing camera surveillance using computer vision: a research note," Policing: An International Journal, vol. 41, no. 2, pp. 292-307, 2018.

M. H. Selamat and H. M. Rais, "Image face recognition using Hybrid Multiclass SVM (HM-SVM)," in International Conference on Computer, Control, Informatics and its Applications (IC3INA), Bandung, Indonesia, 2015.

T. Faseela and M. Jayasree, "Spoof Face Recognition in Video Using KSVM," Procedia Technology, vol. 24, pp. 1285-1291, 2016.

A. Rikhtegar, M. Pooyan and M. T. Manzuri-Shalmani, "Genetic algorithm-optimised structure of convolutional neural network for face recognition applications," IET Computer Vision, vol. 10, no. 6, pp. 559-566, 2016.

C. Ding and D. Tao, "Trunk-Branch Ensemble Convolutional Neural Networks for Video-Based Face Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 4, pp. 1002-1014, 2018.

N. Sudha, A. R. Mohan and P. K. Meher, "A Self-Configurable Systolic Architecture for Face Recognition System Based on Principal Component Neural Network," IEEE Transactions on Circuits and Systems for Video Technology, vol. 21, no. 8, pp. 1071-1084, 2011.

G. Goswami, M. Vatsa and R. Singh, "Face Verification via Learned Representation on Feature-Rich Video Frames," IEEE Transactions on Information Forensics and Security, vol. 12, no. 7, pp. 1686-1698, 2017.

S. Damavandinejadmonfared, "Kernel Entropy Component Analysis using local mean-based k-nearest centroid neighbour (LMKNCN) as a classifier for face recognition in video surveillance camera systems," in IEEE 8th International Conference on Intelligent Computer Communication and Processing, Cluj-Napoca, Romania, 2012.

B. Ko, J.-H. Jung and J.-Y. Nam, "View-independent object detection using shared local features," Journal of Visual Languages & Computing, vol. 28, no. June, pp. 56-70, 2015.

Z. Cui, H. Chang, S. Shan, B. Ma and X. Chen, "Joint sparse representation for video-based face recognition," Neurocomputing, vol. 135, no. July, pp. 306-312, 2014.

C. L. Witham, "Automated face recognition of rhesus macaques," Journal of Neuroscience Methods, vol. 300, no. April, pp. 157-165, 2018.

J.-F. Connolly, E. Granger and R. Sabourin, "An adaptive classification system for video-based face recognition," Information Sciences, vol. 192, no. June, pp. 50-70, 2012.

S. M. K. Hasan and M. Ahmad, "A new approach of sign language recognition system for bilingual users," in International Conference on Electrical & Electronic Engineering (ICEEE), Rajshahi, Bangladesh, 2015.

G. Li and J. Tang, "A new K-NN query algorithm based on the symmetric virtual grid and dynamic circle," in International Conference on Artificial Intelligence and Education (ICAIE), Hangzhou, China, 2010.

N. García-Pedrajas, J. A. R. d. Castillo and G. Cerruela-García, "A Proposal for Local k Values for k -Nearest Neighbor Rule," IEEE Transactions on Neural Networks and Learning Systems, vol. 28, no. 2, pp. 470-475, 2017.

S. Ougiaroglou and G. Evangelidis, "Efficient k-NN classification based on homogeneous clusters," Artificial Intelligence Review, vol. 42, p. 491–513, 2014.

F. Li, R. Zhang and F. You, "Fast pedestrian detection and dynamic tracking for intelligent vehicles within V2V cooperative environment," IET Image Processing, vol. 11, no. 10, pp. 833-840, 2017.

A. Mian, "Online learning from local features for video-based face recognition," Pattern Recognition, vol. 44, no. 5, pp. 1068-1075, 2011.

A. S. Kundu, O. Mazumder, P. K. Lenka and S. Bhaumik, "Hand Gesture Recognition Based Omnidirectional Wheelchair Control Using IMU and EMG Sensors," Journal of Intelligent & Robotic Systems, vol. 91, p. 529–541, 2018.

M.-C. Hu, M.-H. Chang, J.-L. Wu and L. Chi, "Robust Camera Calibration and Player Tracking in Broadcast Basketball Video," IEEE Transactions on Multimedia, vol. 13, no. 2, pp. 266-279, 2011.

W.-G. Chen, X. Wang and Y. Tian, "A two-stage algorithm for the early detection of zero-quantized discrete cosine transform coefficients in High Efficiency Video Coding," EURASIP Journal on Image and Video Processing, vol. 2017, p. 56, 2017.

I. Natgunanathan, Y. Xiang, G. Hua, G. Beliakov and J. Yearwood, "Patchwork-Based Multilayer Audio Watermarking," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 11, pp. 2176-2187, 2017.

L. Yuan-Yuan, C. He-Xin, Z. Yan and S. Hong-Yan, "Discrete cosine transform optimization in image compression based on genetic algorithm," in 8th International Congress on Image and Signal Processing (CISP), Shenyang, China, 2015.

M. Abdelrasoul, M. S. Sayed and V. Goulart, "Real-time unified architecture for forward/inverse discrete cosine transform in high efficiency video coding," IET Circuits, Devices & Systems, vol. 11, no. 4, pp. 381-387, 2017.