Question about model architecture
#40
by
sh0416
- opened
Hello,
I'm just wondering that the architecture is different from starcoder.
Starcoder uses GPTBigCode, while this use custom architecture.
If it differs, could you elaborate details?
Thanks.
AFAIK Santa Coder was an early experiment. Please use the starcoder series models.