what are the problems with making negative preference signals public that don’t apply to positive preference signals and why do they outweigh the loss from not being able to express them?
what are the problems with making negative preference signals public that don’t apply to positive preference signals and why do they outweigh the loss from not being able to express them?
:cheshire_cat_smile:
:cheshire_cat_smile:
x.com/hdevalence/s...
x.com/hdevalence/s...