# # Probability matrix for all 5'ss sequences in the human genome. # Sequences were parsed from hg38 using GENCODE annotation. # Only sequences of the form NNNGYNNNN were considered. # # References # # Frankish A et al. (2019) GENCODE reference annotation # for the human and mouse genomes. Nucl Acids Res. 47(D1):D766–73. # pos A C G U 0 0.32465600595016736 0.3589800287457409 0.18931315100894872 0.12705081429514303 1 0.6285601332087014 0.1108453191011763 0.12065840477889567 0.13993614291122658 2 0.10533739392051032 0.0275965813340302 0.7941429715323356 0.07292305321312387 3 0.0 0.0 1.0 0.0 4 0.0 0.028156084682674495 0.0 0.9718439153173255 5 0.5726332505804429 0.03857557818137959 0.3478000127312139 0.04099115850696364 6 0.6664756984578479 0.08507466186900921 0.12876617785505848 0.11968346181808436 7 0.0921907403870959 0.06590413395917301 0.7578891647318572 0.0840159609218739 8 0.17763728771538367 0.15625889928604692 0.19366186565889057 0.4724419473396788