Can a 3B model deliver 30B class reasoning by fixing the training recipe instead of scaling parameters? Nanbeige LLM Lab at Boss Zhipin has released Nanbeige4-3B, a 3B parameter small language model ...
Please note that these are just the code examples accompanying the book, which we uploaded for your convenience; be aware that these notebooks may not be useful without the formulae and descriptive ...
这是2023年ZJU春夏学期课程地球电磁学的期末作业(written by Meng),感谢Yang Bo老师的悉心指导和学长的作业为我提供的思路 ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results