axolotl

Files

Wing Lian 743ba62bd5 Transformers 4.47.0 (#2138 )

* bump transformers and trl

* fix: update trainer.log signature

* fix trl trainer.log interfaces

* broken 🦥 with latest transformers

* skip parent, call grandparent - yeah, super janky

* update HF HUB env var and fix reward trainer log since it doesn't directly override log

* also bump accelerate

* patches for llama ga

* detab the code to check

* fix whitespace for patch check

* play nicely with CI tests since we patch everytime

* fix pop default in case it doesn't exist

* more tweaks to make patches nicer in CI

* fix detab for when there are possibly multiple patches

---------

Co-authored-by: NanoCode012 <nano@axolotl.ai>

2024-12-07 05:03:01 -05:00

Dockerfile

drop unnecessary BNB_CUDA_VERSION env var from docker as it just results in warnings (#2121 ) [skip ci]

2024-12-04 12:25:47 -05:00

Dockerfile-base

build causal_conv1d and mamba-ssm into the base image (#2113 )

2024-12-02 18:27:46 -05:00

Dockerfile-cloud

Transformers 4.47.0 (#2138 )

2024-12-07 05:03:01 -05:00

Dockerfile-cloud-no-tmux

Transformers 4.47.0 (#2138 )

2024-12-07 05:03:01 -05:00

Dockerfile-tests

drop unnecessary BNB_CUDA_VERSION env var from docker as it just results in warnings (#2121 ) [skip ci]

2024-12-04 12:25:47 -05:00