haha our model likes to talk about goblins no of course we dont know why, we dont know why the model does anything - yes we are trying to make a superintelligent machine god, maybe it will like goblins too, we have no way of knowing what it will like, we hope it will like humans
Reddit discussion of unexplained model behavior (goblin preference) and speculative commentary on AI alignment risks.