DOI: 10.3390/app16136419 ISSN: 2076-3417

Design of an Iterative Cross-Modal and Context-Aware Deep Analytical Framework for Hate Speech and Fake Post Detection on Social Media Sets

Rakesh Bharati, Jyoti Bharti, Vasudev Dehalwar

There is an enormous rise in the amount of user-generated content on social media. That makes it easier for hateful and fake messages to spread, and threatens both societal stability and public trust in institutions. Most of current solutions have fundamental limitations due to modal limitations (i.e., each solution only uses one type of data at a time), lack of user context integration, poor synchronization across different types of data, and poor resilience to manipulation by adversaries. As a result, most solutions are subject to compound loss in terms of their ability to generalize well, classify correctly, or remain reliable when deployed in real-world environments. To address all of the above challenges, we propose a comprehensive and modular analytical framework consisting of five interconnected components that integrate contextual representation learning, multimodal semantic alignment, graph-based propagation modeling, adaptive inference, and consistency validation for hate speech and fake post detection. First is our Context-Driven Social Vector Extraction methodology, which provides enriched contextual embeddings by extracting and combining text-based metadata, image-based metadata, temporal metadata, and behavioral metadata. We use those embeddings in our second module, Multimodal Label Fusion via Mutual Co-Attention (CMF-MCA). Our CMF-MCA module incorporates two transformers with co-attention mechanisms that can mutually annotate text and images. In our third methodology, Semantic Propagation Graph for Hate and Fake Correlation (SPG-HFC), we implement a relational graph attention mechanism that captures both the influence of semantics and how communities propagate information about hate and fake posts. The fourth module, Adaptive Modality Routing via Reinforcement (AMR-R), routes based on the modality of the input and whether the input is simple enough to be classified using machine learning or complex enough to require deep learning. Finally, our Counterfactual Consistency Validation Engine (CCVE) is used after prediction to validate that the model’s predictions are consistent with the output data by creating counterfactuals and validating them. Therefore, in addition to improving the overall accuracy of hate speech and fake post detections, our proposed framework also improves its scalability and inference reliability. Additionally, because our framework allows multimodal classifications that include both context and behavior, it enables the scalable and trustworthy development of content moderation systems.

More from our Archive