Data dtype using for Alphagenome

Wildest · September 14, 2025, 12:32pm

Hi All, thanks for your wonderful work!

I notice that in Alphagenome paper, your team using brain floating point (bfloat16) to store track data, I wondering when you training the model, the dtype for the model is what? Float32 or bfloat16 or float16? Waiting for your answer, thx!

tward · September 15, 2025, 6:25pm

Hello @Wildest, welcome to the forum!

Yes we use bfloat16 targets during training, typically normalized using the track’s mean of non-zero values.

Wildest · September 16, 2025, 8:13am

Thx for your reply!

I also wonder do you use mixed-precision or not while training .

If the original target is 2 (2 different experiments) by 3 (signals at 3 base pair) matrix,
6, 3, 9 (in this track, the mean of non-zero values is 6)
2, 0, 2 (in this track, the mean of non-zero values is 2)
So I need normalized the target into
1, 0.5, 1.5
1, 0, 1
Right?

Guido_Novati · September 16, 2025, 3:01pm

Hi!

We do use Jax mixed precision. I’m not aware if it differs from the implementation of other frameworks. For example, in Jax if x is a BF16 array and we do jax.numpy.sum(x) then the accumulation is carried out in FP32, but the result is cast back as BF16. This affects, for example, the normalization statistics.

We explicitly perform the losses in FP32 and, as is common practice, we cast the attention logits before the Softmax to FP32 and after the Softmax to BF16.

Your math on the non-zero mean scaling is correct
Note that the AlphaGenome API returns unscaled predictions.

Topic		Replies	Views
LoRA Finetuning on Custom CUT&Tag Dataset Help & Support	4	281	February 24, 2026
Can't reproduce alphagenome's benchmarks Help & Support	9	2871	September 20, 2025
Run alphagenome locally Help & Support	4	617	February 12, 2026
Inference speed on NVIDIA H100 GPU Help & Support	1	227	February 4, 2026
AlphaGenome Finetuning Announcements	0	615	March 2, 2026

Data dtype using for Alphagenome

Related topics