Add training callback to send predictions to WandB table (#521)

* WIP Add training callback to send predictions to WandB table

* WIP improve wandb table reporting callback

* WIP improve wandb table reporting callback (cont)

* Add VSCode launching for debugging

* Add tiny llama example

* WIP attempt to improve post-eval prediction generation for table

* WIP attempt to improve post-eval prediction generation for table - part 2

* WIP batch generation

* WIP attempt to handle sample_packing using position_ids for wandb prediction table

* WIP add code for debugging

* Fix sample_packing support for wandb prediction table

* Clean up code for PR review

* Add eval_table_size, eval_table_max_new_tokens configs & clean up code

* Clean up PR, delete VSCode config, add tiny-llama example

* Add eval_table_size, eval_table_max_new_tokens documentation. Fix linting/formatting

This commit is contained in:

Glavin Wiechert

2023-09-13 10:51:08 -03:00

committed by

GitHub

parent 2f586d18db

commit 5b67ea98a6

9 changed files with 278 additions and 3 deletions

									
										2

examples/llama-2/lora.yml
									
												View File
												
				@@ -56,6 +56,8 @@ flash_attention: true

				warmup_steps: 10

				eval_steps: 20

				eval_table_size: 5

				eval_table_max_new_tokens: 128

				save_steps:

				debug:

				deepspeed:

Add training callback to send predictions to WandB table (#521)

2 examples/llama-2/lora.yml Unescape Escape View File

2

examples/llama-2/lora.yml

View File