Hate speech detection faces a critical challenge: content moderation systems must adapt to platform-specific social norms while maintaining generalization across communities. This issue is important because current NLP models often underperform when applied across different platforms, leading to inconsistent moderation. We address this by proposing a domain-aware transformer model that explicitly encodes platform information through specialized tokens. Our solution enables the same model to learn both platform-specific nuances and cross-platform patterns. Evaluating on Twitter and Reddit data, we demonstrate that this approach outperforms conventional generalist models while nearly matching platform-specific specialists. The findings highlight an efficient pathway for improving content moderation systems that operate across diverse online communities.
-
Notifications
You must be signed in to change notification settings - Fork 0
Will1618sd/deep-learning
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published