* feat: add arg to enable dft in liger * feat: add tests use_token_scaling * fix: test * fix: move check to args