The amount of tacit knowledge in training models is literally insane. Is there some secret cabal trying to restrict this knowledge? Is karpathy the only defector?
76,04K