S1K Custom - Search News

zhangbo2008/train_simplescaling-s1K-1.1

已经把参数改成cpu也能自动适配的了. 运行2.py即可.都不需要手动改. 第一步运行:data/fix_gpqa.py data/add_aime.py data/collect_data.py data ...

Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable

A two-stage sequential pipelin is considered. Reasoning training.. At this stage, we supervised fine-tune(SFT) the model on reasoning dataset (e.g., s1k) to produce the Large Reasoning Model (LRM).

Frontiers26d

Postsynaptic spiking determines anti-Hebbian LTD in visual cortex basket cells

Activity-dependent synaptic plasticity such as LTP underlies circuit development and memory encoding in the brain (Bliss and Collingridge, 1993; Katz and Shatz, 1996; Malenka and Bear, 2004; Hensch, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Trending now