Research


ARF-Net: a multi-modal aesthetic attention-based fusion (Visit Research)

Can a deep fusion method be established for biometric identification?

  • Over the last decade, Online Social Media platforms have witnessed a dramatic expansion due to the substantial reliance of individuals on these communication channels. These platforms are widely utilized to convey emotions, share opinions, and express preferences through various means such as artworks, multimedia contents, and blogs. Researchers are exploring these individual-specific traits for biometric identification. Aesthetic biometric systems utilize users’ unique preferences across various subjective forms such as images, music, and textual contents. This study introduces a novel multi-modal aesthetic system, with a primary contribution to the development of an attention-based fusion method for person identification. The proposed identification system leverages a deep pre-trained model for high-level feature extraction from visual and auditory modalities. The paper introduces a novel fusion architecture named attention-based residual fusion network (ARF-Net) to incorporate two heterogeneous aesthetic feature vectors. The proposed model yielded a 99.38% identification accuracy on the Aesthetic Image Audio 32 (AIA32) dataset and 98.02% identification accuracy on Aesthetic Image Audio 52 (AIA52) dataset, outperforming other aesthetic biometric systems. The proposed architecture stands out for its efficiency, showcasing a lightweight architecture with minimal parameters, ensuring optimal performance in different modalities.

Fig 1. Architecture of the proposed multi-modal aesthetic system.

Unimodal Aesthetic Biometric System (Visit Research)

Can a deep learning-based aesthetic system can identify users with the highest precision and lower inference time

  • A novel three-stage framework based on deep learning and classical machine learning is proposed for identifying individuals from their audio aesthetic. To extract audio features, mel-spectrograms are used instead of generic spectrograms. As a result, the extracted feature set becomes more discriminating for person identification. A novel hybrid meta-heuristic algorithm Cuckoo Search based Whale Optimization Algorithm (CSWOA) is proposed that retrieves the most optimal feature subset from the high-level features.

Fig 4. Proposed Hybrid meta-heuristic feature selection algorithm (CSWOA) along with transfer learning-based feature extraction method.

Online Judging Platform Utilizing Dynamic Plagiarism Detection Facilities (Visit Research)

What approaches can be implemented to develop an efficient plagiarism detection system for programming problems in an online judging platform, considering the challenges faced by teachers in identifying source code plagiarism and the prevalence of plagiarism among students?

  • Designed an online judging framework designed specifically for programming labs, addressing the limitations of traditional online judging platforms. The proposed system incorporates automatic scoring of codes with efficient detection of plagiarized content, utilizing program fingerprints generated by the Rabin-Karp Algorithm. By selecting fingerprints through winnowing among k-gram hash values, the system improves time efficiency, correctness, and feature availability compared to existing online judging platforms. The evaluation of the system with large datasets and comparison of runtime with the widely used MOSS plagiarism detection technique further demonstrates its superiority.


Fig 5. Flowchart of the proposed Online Judging System.

Human Activity Recognition (Visit Research)

Can the integration of transfer learning with a two-stream neural network architecture, utilizing 3D dense optical flow, result in a significantly improved Human Activity Recognition (HAR) system?

  • In this work a novel human activity recognition technique is developed that combines action recognition and 3D dense optical flow from video sequences, leveraging the efficiency of optical flow as a feature for accurate recognition. and 3D dense optical flow from video sequences, leveraging the efficiency of optical flow as a feature for accurate recognition. The proposed method utilizes transfer learning with a two-stream neural network architecture, incorporating the pre-trained ResNet152 architecture, to extract fine-grained features from the dense optical flow. The proposed approach was evaluated on the UCF-101 dataset, highlighting its potential for advancing human activity recognition in various computer vision applications.


Fig 4. Architecture of the proposed Human Activity Recognition(HAR) system.

Cervical Cancer Prediction (Visit Research)

Can machine learning algorithms along with Adaptive Synthetic Sampling(ADASYN) and Linear Discriminant Analysis (LDA) identify patterns or interactions among risk factors that may enhance the prediction of cervical cancer, and how do these findings align with existing knowledge in the field?

  • In this study, a significant contributions were made in developing a cervical cancer prediction system. Missing value imputation led to higher precision, highlighting the importance of addressing missing data in cervical cancer datasets. Additionally, Linear Discriminant Analysis (LDA) is implemented for dimensionality reduction and the Adaptive Synthetic Sampling approach (ADASYN) is adopted to balance the dataset, resulting in improved outcomes. Moreover, the Isolation Forest algorithm is used, to identify and removed outliers in the cervical cancer datasets, enhancing the quality of the data. Lastly, an effective classifier model using various machine learning techniques is developed and evaluated their performence using different evaluation approaches, resulting in higher performance efficacy for cervical cancer prediction.


Fig 5. Architecture of the proposed Cervical Cancer Prediction system.