It’s possible it reduces the probability of things like wrongly answered stack overflow questions from being used, so it might actually work a bit.
Kinda like how with image generation, you get vastly better results by adding a negative prompt such as “low quality, jpeg artifacts, extra fingers, bad hands” etc, because the dataset from boorus actually do include a bunch of those tags and using them steers the generation to do thing that don’t have features that match them.


Which would still not be perfect because “foo@bar”, “foo@[123.123.123.123]” and “💩 @[IPv6 :::1]” are all technically valid email addresses.
It looks like the only validation that doesn’t block something valid pretty much would start and end at “It has at least one @ symbol, and something on both sides”.