Predict gene-level expression given sequences

dpanc · November 17, 2025, 3:44pm

Hi,

I tested the model.score_interval and compared its predictions with those from model.score_variant.
I found that the correlation between the two outputs is quite high (~0.98), which is great, but the absolute values differ substantially for highly expressed genes.

Why could this difference occur, especially for genes with high predicted expression?
Also, could someone clarify what the width parameter in GeneMaskScorer actually means in practice?

I’ve read this thread, but the explanation there (“The width of the target interval to include in the aggregation”) is still not fully clear to me — does it define the window size around the gene body used for aggregation, or something else?

Thanks in advance!

Topic		Replies	Views
Use alphagenome prediect wheat genome Feedback & Feature Requests testing	2	1668	February 5, 2026
How to visualize predictions made with predict_sequence? Help & Support	5	2306	August 11, 2025
DNA methylation data for gene expression prediction Feedback & Feature Requests	2	282	March 2, 2026
Predicted ALT expression and the REF expression separately Help & Support	1	107	February 24, 2026
Why are there different tracks for different cell line Help & Support	5	1308	March 6, 2026

Predict gene-level expression given sequences

Related topics