RNA_feature¶
Purpose¶
Builds RNA-context features and annotations (ASE, hFDR, editing/imprinted/PON tags, sequence-context metrics).
Upstream¶
genotyping
Required inputs¶
germline_fileind_geno_filter_fileerror_count_file- resources:
gene_bed,dbsnp_vcf_file,imprinted_bed,editing_bed,PON_file,genome_fasta,reference_error_profile
Input interpretation¶
| Input key | Source | Required | Interpretation |
|---|---|---|---|
germline_file |
genotyping output |
Yes | Germline context used for RNA-level contrast and filtering logic. |
ind_geno_filter_file |
genotyping output |
Yes | Candidate loci table entering RNA annotation and tests. |
error_count_file |
umi_combine output |
Yes | Error-profile/count evidence used in RNA-context filtering features. |
| RNA/filter resources | resource_details.* |
Yes | Annotation resources for dbSNP/editing/imprinted/PON/reference-error-derived features. |
Parameters¶
From steps.RNA_feature:
min_count_for_germlinemin_prior_for_germlinedefault_range_of_genep_thresholdprevious_base
Parameter interpretation highlights¶
| Parameter | Interpretation |
|---|---|
min_count_for_germline, min_prior_for_germline |
Controls germline evidence sufficiency for RNA-context decisions. |
default_range_of_gene |
Window size for gene-context assignment when annotation ambiguity exists. |
p_threshold |
Main significance threshold for RNA-derived statistical tests. |
previous_base |
Sequence-context offset used in motif/context feature extraction. |
Outputs¶
RNA_feature:output_dir/RNA_feature/RNA_feature.txt- parquet mirror:
RNA_feature.parquet
Tuning notes¶
- This step merges several RNA-oriented annotation and statistical modules.
- Output fields are heavily used in final merged filtration rules.