Hasty Briefsbeta

Bilingual

An NSFW Filter for Marginalia Search

5 hours ago

#Neural Network
#NSFW Filter
#Search Engine

The author developed an NSFW filter for Marginalia Search using a single hidden layer neural network, as other methods like Fasttext produced too many false positives.
Training data was generated by using an LLM (ollama/qwen) to classify search results as SAFE or NSFW, avoiding manual labeling but resulting in skewed samples due to NSFW search terms.
The neural network uses handpicked features (e.g., 'cum', 'balls') and disambiguating terms (e.g., 'laude', 'golf') to reduce false positives, though balancing accuracy and false positives remains challenging.
Implementation involves forward propagation and backpropagation with gradient descent, using ReLU and sigmoid activation functions, and binary cross-entropy loss.
The filter is currently available via the API, with future UI plans, and shows ~90% accuracy in evaluations, but practical false positive rates are higher due to low NSFW base rates.