Output data#
Neoantigens#
NeoFox returns the neoantigen candidates and their annotated features as output. Two output formats are supported: tabular format or JSON format. The user gets the neoantigen annotations in all formats. Despite different structures, both formats provide the same content with the exception of the metadata on the annotations which is only present in the JSON format.
For each neoantigen candidate (
mutatedXmer
), the best predicted epitope based on MHC prediction is annotated. Columns that refer to a predicted epitope consist of 3 components, each separated by “_”. The first part contains the tool name, the second part contains the selection method of the best epitope and the third part contains the name of the value. Example: “ToolName_SelectionMethod_ValueName” -> NetMHCpan_bestRank_peptide (literally: the peptide with the best predicted NetMHCpan rank).
If no unique selection of the best predicted epitope can be made due to ties, the lexicographically first epitope is returned. If there are nevertheless two or more alleles that have equally good predicted values for the peptide, the lexicographically first one is reported. These cases can be analysed using the –with-all-epitopes options.
The following table describes each of the annotations in the output:
TABLE 1
Column name |
Description |
Feature group/ Paper |
---|---|---|
dnaVariantAlleleFrequency |
the variant allele frequency calculated from the DNA |
|
mutatedXmer |
the long mutated amino acid sequence |
|
wildTypeXmer |
the long non-mutated amino acid sequence. This field shall be empty for alternative neoantigen classes |
|
patientIdentifier |
the patient identifier |
|
rnaExpression |
the RNA expression. If expression was imputed, this will will be |
expression |
imputedGeneExpression |
median gene expression in the TCGA cohort of the tumor entity provided in the patient file. |
expression |
rnaVariantAlleleFrequency |
the variant allele frequency calculated from the RNA |
|
gene |
the HGNC gene symbol |
|
Mutated_rnaExpression_fromRNA |
transcript expression normalized by the variant allele frequency in RNA of the mutation |
expression |
Mutated_rnaExpression_fromDNA |
transcript expression normalized by the variant allele frequency in DNA of the mutation |
|
Mutated_imputedGeneExpression_fromRNA |
imputeted gene expression normalized by the variant allele frequency in RNA of the mutation |
expression |
Mutated_imputedGeneExpression_fromDNA |
imputed gene expression normalized by the variant allele frequency in RNA of the mutation |
|
mutation_not_found_in_proteome |
indicates if mutated amino acid sequence was not found in the WT proteome by exact search |
Priority score |
NetMHCpan_bestRank_rank |
minimal MHC I binding rank score over all neoepitope candidates (8-11mers) and MHC I alleles |
MHC I binding with netMHCpan |
NetMHCpan_bestRank_peptide |
neoepitope candidate sequence with minimal MHC I binding rank score |
MHC I binding with netMHCpan |
NetMHCpan_bestRank_core |
The minimal 9 amino acid binding core directly in contact with the MHC. |
MHC I binding with netMHCpan |
NetMHCpan_bestRank_Icore |
Interaction core. This is the sequence of the binding core including eventual insertions of deletions. |
MHC I binding with netMHCpan |
NetMHCpan_bestRank_Of |
Starting position offset of the core in the peptide (0 based) |
MHC I binding with netMHCpan |
NetMHCpan_bestRank_Gp |
Position of the deletion (0 based), if any, in the Icore compared to the core. |
MHC I binding with netMHCpan |
NetMHCpan_bestRank_Gl |
Length of the deletion, if any, in the Icore compared to the core. |
MHC I binding with netMHCpan |
NetMHCpan_bestRank_allele |
the MHC I allele related to |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity_affinity |
minimal MHC I binding affinity over all neoepitope candidates (8-11mers) and MHC I alleles |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity_peptide |
neoepitope candidate sequence with minimal MHC I binding affinity |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity_core |
The minimal 9 amino acid binding core directly in contact with the MHC. |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity_Icore |
Interaction core. This is the sequence of the binding core including eventual insertions of deletions. |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity_Of |
Starting position offset of the core in the peptide (0 based) |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity_Gp |
Position of the deletion (0 based), if any, in the Icore compared to the core. |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity_Gl |
Length of the deletion, if any, in the Icore compared to the core. |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity_allele |
the MHC I allele related to |
MHC I binding with netMHCpan |
NetMHCpan_bestRank9mer_rank |
minimal MHC I binding rank score over all neoepitope candidates (9mers only) and MHC I alleles |
MHC I binding with netMHCpan |
NetMHCpan_bestRank9mer_peptide |
neoepitope candidate sequence (9mer) with minimal MHC I binding rank score |
MHC I binding with netMHCpan |
NetMHCpan_bestRank9mer_allele |
the MHC I allele related to |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity9mer_affinity |
minimal MHC I binding affinity over all neoepitope candidates (9mers) and MHC I alleles |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity9mer_allele |
the MHC I allele related to |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity9mer_peptide |
neoepitope candidate sequence (9mer) with minimal MHC I binding affinity |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity_affinityWT |
MHC I binding affinity of |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity_peptideWT |
WT epitope that corresponds to |
MHC I binding with netMHCpan |
NetMHCpan_bestRank_rankWT |
MHC I binding rank score of |
MHC I binding with netMHCpan |
NetMHCpan_bestRank_peptideWT |
WT epitope that corresponds to |
MHC I binding with netMHCpan |
NetMHCpan_bestRank9mer_rankWT |
MHC I binding rank score of |
MHC I binding with netMHCpan |
NetMHCpan_bestRank9mer_peptideWT |
WT epitope that corresponds to |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity9mer_affinityWT |
MHC I binding affinity of |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity9mer_peptideWT |
WT epitope that corresponds to |
|
PHBR_I |
harmonic mean of minimal MHC I binding rank scores of all MHC I alleles of a patient |
PHBR-I |
NetMHCpan_bestAffinity9mer_positionMutation |
indicates position of the mutation in |
MHC I binding with netMHCpan |
NetMHCpan_bestAffinity9mer_anchorMutated |
mutation in |
anchor/non-anchor |
NetMHCIIpan_bestRank_rank |
minimal MHC II binding rank score over all neoepitope candidates (15mers) and all MHC II alleles |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestRank_peptide |
neoepitope candidate sequence with minimal MHC II binding rank score |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestRank_core |
Binding core register (9mer) |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestRank_Of |
Starting position offset of the optimal binding core (starting from 0) |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestRank_coreRel |
Reliability of the binding core, expressed as the fraction of networks in the ensemble selecting the optimal core |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestRank_allele |
the MHC II isoform related to |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestAffinity_affinity |
minimal MHC II binding affinity over all neoepitope candidates (15mers) and all MHC II alleles |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestAffinity_peptide |
neoepitope candidate sequence with minimal MHC II binding affinity |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestAffinity_core |
Binding core register (9mer) |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestAffinity_Of |
Starting position offset of the optimal binding core (starting from 0) |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestAffinity_coreRel |
Reliability of the binding core, expressed as the fraction of networks in the ensemble selecting the optimal core |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestAffinity_allele |
the MHC II isoform related to |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestRank_rankWT |
minimal MHC II binding rank of |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestRank_peptideWT |
WT epitope sequence (15mer) that corresponds to |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestAffinity_affinityWT |
minimal MHC II binding rank of |
MHC II binding with netMHCIIpan |
NetMHCIIpan_bestAffinity_peptideWT |
WT epitope sequence (15mer) that corresponds to |
MHC II binding with netMHCIIpan |
PHBR_II |
harmonic mean of minimal MHC II binding rank scores of all MHC II alleles of a patient |
PHBR-II |
Amplitude_MHCI_bestAffinity9mer |
ratio of |
Recognition Potential |
Amplitude_MHCI_bestAffinity |
ratio of |
Generator rate |
Amplitude_MHCII_bestRank |
ratio of |
Generator rate |
Pathogensimiliarity_MHCI_bestAffinity9mer |
score representing the similarity of |
Recognition Potential |
Pathogensimiliarity_MHCII_bestAffinity |
score representing the similarity of |
Recognition Potential |
RecognitionPotential_MHCI_bestAffinity9mer |
product of |
Recognition Potential |
DAI_MHCI_bestAffinity |
difference of |
DAI |
Classically_defined_neopeptide_MHCI |
|
Generator rate |
Alternatively_defined_neopeptide_MHCI |
|
Generator rate |
Classically_defined_neopeptide_MHCII |
|
Generator rate |
Alternatively_defined_neopeptide_MHCII |
|
Generator rate |
GeneratorRate_CDN_MHCI |
number of neoepitope candidates with MHC I binding affinity < 50 nM per neoantigen canidate |
Generator rate |
GeneratorRate_ADN_MHCI |
number of neoepitope candidates with MHC I binding affinity < 5000 nM per neoantigen canidate 10x better affinity in comparison to corresponding WT peptide |
Generator rate |
GeneratorRate_MHCI |
sum of |
Generator rate |
GeneratorRate_CDN_MHCII |
number of neoepitope candidates with MHC II binding rank score < 1 per neoantigen canidate |
Generator rate |
GeneratorRate_ADN_MHCII |
number of neoepitope candidates with MHC II binding rank score < 4 per neoantigen candidate 4x better rank in comparison to corresponding WT peptide |
Generator rate |
GeneratorRate_MHCII |
sum of |
Generator rate |
ImprovedBinder_MHCI |
ratio of |
self-similarity |
Selfsimilarity_MHCI_conserved_binder |
score representing the similarity between |
self-similarity |
Selfsimilarity_MHCI |
score representing the similarity between |
self-similarity |
Selfsimilarity_MHCII |
score representing the similarity between |
self-similarity |
Number_of_mismatches_MCHI |
number of amino acids that do no match between |
Priority score |
Priority_score_fromDNA |
combinatorial score of several features such as MHC binding, transcription expression and VAF in DNA |
Priority score |
Priority_score_fromRNA |
combinatorial score of several features such as MHC binding, transcription expression and VAF in RNA |
Priority score |
Priority_score_imputed_fromDNA |
combinatorial score of several features such as MHC binding, imputed gene expression and VAF in DNA |
Priority score |
Priority_score_imputed_fromRNA |
combinatorial score of several features such as MHC binding, imputed gene expression and VAF in RNA |
Priority score |
IEDB_Immunogenicity_MHCI |
IEDB Immunogenicity score for |
IEDB Immunogenicity |
IEDB_Immunogenicity_MHCII |
IEDB Immunogenicity score for |
IEDB Immunogenicity |
MixMHCpred_bestScore_peptide |
MHC class I neoepitope candidate sequence with maximum MixMHCpred score over all neoepitope canidates (8-11mers) and MHC I alleles |
MixMHCpred |
MixMHCpred_bestScore_score |
maximum MixMHCpred score over all neoepitope canidates (8-11mers) and MHC I alleles |
MixMHCpred |
MixMHCpred_bestScore_rank |
rank that corresponds to |
MixMHCpred |
MixMHCpred_bestScore_allele |
the allele with maximum MixMHCpred score |
MixMHCpred |
MixMHC2pred_bestRank_peptide |
MHC class II neoepitope candidate sequence with minimal MixMHC2pred score over all neoepitope canidates (13-18mers) and MHC II alleles |
MixMHC2pred |
MixMHC2pred_bestRank_rank |
minimal MixMHC2pred score over all neoepitope canidates (13-18mers) and MHC II alleles |
MixMHC2pred |
MixMHC2pred_bestRank_allele |
the MHC II isoform with minimum MixMHC2pred rank score |
MixMHC2pred |
Dissimilarity_MHCI |
score reflecting the dissimilarity of |
dissimilarity |
Dissimilarity_MHCII |
score reflecting the dissimilarity of |
dissimilarity |
Vaxrank_bindingScore |
total binding score of vaxrank |
vaxrank |
Vaxrank_totalScore |
product of total binding score and transcription expression score. Originally, the root of the number of reads supporting the mutation are used in the original implementation. To simplify, the transcript expression normalised to VAF is used. |
vaxrank |
Vaxrank_totalScore_imputed |
product of total binding score and imputed gene expression score. Originally, the root of the number of reads supporting the mutation are used in the original implementation. To simplify, the imputed gene expression normalised to VAF is used. |
vaxrank |
PRIME_bestScore_allele |
best predicted MHC allele by PRIME model |
PRIME |
PRIME_bestScore_peptide |
best predicted neoepitope candidate by PRIME model |
PRIME |
PRIME_bestScore_rank |
output rank score of PRIME model |
PRIME |
PRIME_bestScore_score |
output score of PRIME model |
PRIME |
HexAlignmentScore_MHCI |
the alignment score by HEX for |
HEX |
HexAlignmentScore_MHCII |
the alignment score by HEX for |
HEX |
In addition, all logging output is appended to a log file with the suffix
“:raw-html-m2r:`<folder>`/:raw-html-m2r:`<prefix>`.log“, where the folder is set by --output-folder
and the
prefix can be set with --output-prefix
.
Tabular format#
An output table with the suffix “_neoantigen_candidates_annotated.tsv“ is created.
This table contains the neoantigen candidates information, the neoantigen annotations and external annotations if they were present provided in the input table.
This is a dummy example:
patientIdentifier |
gene |
mutatedXmer |
wildTypeXmer |
position |
dnaVariantAlleleFrequency |
rnaVariantAlleleFrequency |
rnaExpression |
imputedGeneExpression |
Alternatively_defined_neopeptide_MHCI |
Alternatively_defined_neopeptide_MHCII |
Amplitude_MHCII_bestRank |
Amplitude_MHCI_bestAffinity |
Amplitude_MHCI_bestAffinity9mer |
Classically_defined_neopeptide_MHCI |
Classically_defined_neopeptide_MHCII |
DAI_MHCI_bestAffinity |
Dissimilarity_MHCI |
Dissimilarity_MHCII |
GeneratorRate_ADN_MHCI |
GeneratorRate_ADN_MHCII |
GeneratorRate_CDN_MHCI |
GeneratorRate_CDN_MHCII |
GeneratorRate_MHCI |
GeneratorRate_MHCII |
HexAlignmentScore_MHCI |
HexAlignmentScore_MHCII |
IEDB_Immunogenicity_MHCI |
IEDB_Immunogenicity_MHCII |
Improved_Binder_MHCI |
MixMHC2pred_bestRank_allele |
MixMHC2pred_bestRank_peptide |
MixMHC2pred_bestRank_rank |
MixMHCpred_bestScore_allele |
MixMHCpred_bestScore_peptide |
MixMHCpred_bestScore_rank |
MixMHCpred_bestScore_score |
Mutated_imputedGeneExpression_fromDNA |
Mutated_imputedGeneExpression_fromRNA |
Mutated_rnaExpression_fromDNA |
Mutated_rnaExpression_fromRNA |
NetMHCIIpan_bestAffinity_Of |
NetMHCIIpan_bestAffinity_affinity |
NetMHCIIpan_bestAffinity_affinityWT |
NetMHCIIpan_bestAffinity_allele |
NetMHCIIpan_bestAffinity_core |
NetMHCIIpan_bestAffinity_coreRel |
NetMHCIIpan_bestAffinity_peptide |
NetMHCIIpan_bestAffinity_peptideWT |
NetMHCIIpan_bestRank_Of |
NetMHCIIpan_bestRank_allele |
NetMHCIIpan_bestRank_core |
NetMHCIIpan_bestRank_coreRel |
NetMHCIIpan_bestRank_peptide |
NetMHCIIpan_bestRank_peptideWT |
NetMHCIIpan_bestRank_rank |
NetMHCIIpan_bestRank_rankWT |
NetMHCpan_bestAffinity9mer_affinity |
NetMHCpan_bestAffinity9mer_affinityWT |
NetMHCpan_bestAffinity9mer_allele |
NetMHCpan_bestAffinity9mer_anchorMutated |
NetMHCpan_bestAffinity9mer_peptide |
NetMHCpan_bestAffinity9mer_peptideWT |
NetMHCpan_bestAffinity9mer_positionMutation |
NetMHCpan_bestAffinity_Gl |
NetMHCpan_bestAffinity_Gp |
NetMHCpan_bestAffinity_Icore |
NetMHCpan_bestAffinity_Of |
NetMHCpan_bestAffinity_affinity |
NetMHCpan_bestAffinity_affinityWT |
NetMHCpan_bestAffinity_allele |
NetMHCpan_bestAffinity_core |
NetMHCpan_bestAffinity_peptide |
NetMHCpan_bestAffinity_peptideWT |
NetMHCpan_bestRank9mer_allele |
NetMHCpan_bestRank9mer_peptide |
NetMHCpan_bestRank9mer_peptideWT |
NetMHCpan_bestRank9mer_rank |
NetMHCpan_bestRank9mer_rankWT |
NetMHCpan_bestRank_Gl |
NetMHCpan_bestRank_Gp |
NetMHCpan_bestRank_Icore |
NetMHCpan_bestRank_Of |
NetMHCpan_bestRank_allele |
NetMHCpan_bestRank_core |
NetMHCpan_bestRank_peptide |
NetMHCpan_bestRank_peptideWT |
NetMHCpan_bestRank_rank |
NetMHCpan_bestRank_rankWT |
Number_of_mismatches_MCHI |
PHBR_I |
PHBR_II |
PRIME_best_allele |
PRIME_best_peptide |
PRIME_best_rank |
PRIME_best_score |
Pathogensimiliarity_MHCII_bestAffinity |
Pathogensimiliarity_MHCI_bestAffinity |
Pathogensimiliarity_MHCI_bestAffinity9mer |
Priority_score_fromDNA |
Priority_score_fromRNA |
Priority_score_imputed_fromDNA |
Priority_score_imputed_fromRNA |
RecognitionPotential_MHCI_bestAffinity |
RecognitionPotential_MHCI_bestAffinity9mer |
Selfsimilarity_MHCI |
Selfsimilarity_MHCII |
Selfsimilarity_MHCI_conserved_binder |
Vaxrank_bindingScore |
Vaxrank_totalScore |
Vaxrank_totalScore_imputed |
mutation_not_found_in_proteome |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Ptx |
BRCA2 |
AAAAAAAAAAAAALAAAAAAAAAAAAA |
AAAAAAAAAAAAAFAAAAAAAAAAAAA |
14 |
0.857 |
0.294 |
0.51950689 |
0.53659963 |
0 |
0 |
0.33333 |
2.3262 |
0.56811 |
0 |
1 |
7165.9 |
1 |
3.00E-05 |
0 |
1 |
0 |
18 |
0 |
19 |
177 |
389 |
0.18182 |
0.36258 |
0 |
HLA-DQA101:02-DQB106:02 |
AAAAAAALAAAAAAA |
0.00438 |
HLA-B*07:02 |
AAALAAAAA |
0.18911 |
-0.2706 |
0.45987 |
0.15776 |
0.44522 |
0.15274 |
3 |
14.84 |
21.8 |
HLA-DQA101:02-DQB106:02 |
AAAAALAAA |
0.74 |
AAAAAAAALAAAAAA |
AAAAAAAAFAAAAAA |
3 |
HLA-DQA103:01-DQB106:02 |
LAAAAAAAA |
0.75 |
AAALAAAAAAAAAAA |
AAAFAAAAAAAAAAA |
0.03 |
0.01 |
1534.8 |
1180.8 |
HLA-C*03:04 |
0 |
AAALAAAAA |
AAAFAAAAA |
4 |
1 |
1 |
AALAAAAAAA |
0 |
1018.2 |
8184.1 |
HLA-A*02:01 |
ALAAAAAAA |
AALAAAAAAA |
AAFAAAAAAA |
HLA-B*07:02 |
AAALAAAAA |
AAAFAAAAA |
4.389 |
5.068 |
0 |
0 |
AAALAAAAA |
0 |
HLA-B*07:02 |
AAALAAAAA |
AAALAAAAA |
AAAFAAAAA |
4.389 |
5.068 |
1 |
8.5743 |
NA |
HLA-B*07:02 |
AAALAAAAA |
0.114 |
0.10131 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0.97634652 |
0.98126377 |
0.97634652 |
0.01301 |
0.00199 |
0.00205 |
1 |
Ptx |
BRCA2 |
AAAAAAAAAAAAARAAAAAAAAAAAAA |
AAAAAAAAAAAAAMAAAAAAAAAAAAA |
14 |
0.556 |
0.173 |
0.71575659 |
0.53659963 |
0 |
1 |
390 |
5.6409 |
5.6409 |
0 |
1 |
1498 |
1 |
0 |
0 |
18 |
0 |
27 |
0 |
45 |
132 |
409 |
0.19477 |
0.42378 |
1 |
HLA-DQA101:02-DQB106:02 |
AAAARAAAAAAAAAA |
0.00438 |
HLA-B*07:02 |
AAAAAARAA |
0.08828 |
0.04913 |
0.29835 |
0.09283 |
0.39796 |
0.12383 |
3 |
13.55 |
15.81 |
HLA-DQA101:02-DQB106:02 |
AAAARAAAA |
0.97 |
AAAAAAARAAAAAAA |
AAAAAAAMAAAAAAA |
3 |
HLA-DPA101:03-DPB106:01 |
RAAAAAAAA |
1 |
AAARAAAAAAAAAAA |
AAAMAAAAAAAAAAA |
0.01 |
3.9 |
199.38 |
1697.4 |
HLA-B*07:02 |
0 |
AAAAARAAA |
AAAAAMAAA |
6 |
0 |
0 |
AAAAARAAA |
0 |
199.38 |
1697.4 |
HLA-B*07:02 |
AAAAARAAA |
AAAAARAAA |
AAAAAMAAA |
HLA-B*07:02 |
AAAAARAAA |
AAAAAMAAA |
1.346 |
4.347 |
0 |
0 |
AAAAARAAA |
0 |
HLA-B*07:02 |
AAAAARAAA |
AAAAARAAA |
AAAAAMAAA |
1.346 |
4.347 |
1 |
4.6189 |
NA |
HLA-B*07:02 |
AAAAAARAA |
0.129 |
0.09563 |
0 |
0 |
0 |
0.32903 |
0.10238 |
0.26268 |
0.08173 |
0 |
0 |
0.9663076 |
0.97299187 |
NA |
0.80336 |
0.09948 |
0.07458 |
1 |
JSON format#
An output file with the suffix “_neoantigen_candidates_annotated.json“ is created.
This file contains neoantigen candidates information in JSON format.
The names within the models are described in TABLE 1.
This is a dummy example of a “_neoantigen_candidates.json“ file.
This file contains a list of neoantigen candidate models (for further information, please see here.
To simplify, only one full neoantigen candidate model is shown:
[{
"patient_identifier": "Ptx",
"gene": "BRCA2",
"position": [14],
"wild_type_xmer": "AAAAAAAAAAAAALAAAAAAAAAAAAA",
"mutated_xmer": "AAAAAAAAAAAAAFAAAAAAAAAAAAA",
"rna_expression": 0.5195068939999999,
"imputed_gene_expression": 0.5,
"dna_variant_allele_frequency": 0.294,
"rna_variant_allele_frequency": 0.857,
"neofox_annotations": [...],
"external_annotations": [...]
}, {
"patient_identifier": "Ptx",
"gene": "BRCA2",
"position": [14],
"wild_type_xmer": "AAAAAAAAAAAAARAAAAAAAAAAAAA",
"mutated_xmer": "AAAAAAAAAAAAAMAAAAAAAAAAAAA",
"rna_expression": 0.715756594,
"imputed_gene_expression": 0.5,
"dna_variant_allele_frequency": 0.17300000000000001,
"rna_variant_allele_frequency": 0.556,
"neofox_annotations": [ ... ],
"external_annotations": [ ... ]
}]
Notice that for simplicity purposes both fields neofox_annotations
and external_annotations
are not shown above.
For further information, please see here.
This is a dummy example of the field neofox_annotations
.
{
"annotations": [{
"name": "NetMHCpan_MHCI_rank_bestRank",
"value": "0.0592"
}, {
"name": "NetMHCpan_MHCI_rank_peptide",
"value": "AAAAAAAAF"
}],
"annotator": "Neofox",
"annotator_version": "1.0.0",
"timestamp": "20201211115212061465"
}
And this is a dummy example of the field external_annotations
:
[
{
"name": "external_annotation_1",
"value": "0.857"
}, {
"name": "external_annotation_2",
"value": "that"
}
]
The metadata on the annotations will look as follows:
{
annotator: "NeoFox",
annotator_version: "0.5.4",
timestamp: "20211104154037935536",
resources: [
{
name: "netMHCpan",
version: "4.1"
},
{
name: "netMHCIIpan",
version: "4.0"
},
{
name: "mixMHCpred",
version: "2.1"
},
{
name: "mixMHC2pred",
version: "1.2"
},
{
name: "IEDB",
url: "http://www.iedb.org/downloader.php?file_name=doc/tcell_full_v3.zip",
hash: "d225ab671ef375400d387354a5f450ff",
download_timestamp: "20211103221051"
},
{
name: "Human Ensembl proteome",
version: "2021_03",
url: "https://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/reference_proteomes/Eukaryota/UP000005640/UP000005640_9606.fasta.gz",
hash: "a41303fd38380ca0321cf8a3d9beb4bc",
download_timestamp: "20211103221051"
},
{
name: "Mouse Ensembl proteome",
version: "2021_03",
url: "https://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/reference_proteomes/Eukaryota/UP000000589/UP000000589_10090.fasta.gz",
hash: "27a5de8c1eca42eebaf56400945cf7cb",
download_timestamp: "20211103221051"
},
{
name: "IMGT/HLA database",
version: "3.46.0",
url: "https://raw.githubusercontent.com/ANHIG/IMGTHLA/Latest/Allelelist.txt",
hash: "5a7618819498b88d0790bf4d58975d13",
download_timestamp: "20211103221051"
}
]
}
Neoepitopes#
NeoFox returns the neoepitopes candidates and their annotated features as output when using
the flag --with-all-neoepitopes
or when explicitly annotating neoepitopes with the command neofox-epitope
.
Two output formats are supported: tabular format or JSON format.
The user gets the neoepitope annotations in all formats.
Despite different structures, both formats provide the same content with the exception of the metadata on
the annotations which is only present in the JSON format.
The tabular format is split into two tables:
a first one for the MHC-I neoepitope candidates and a second one for
the MHC-II neoepitope candidates.
Tabular format#
Two output files with the suffix “_mhcI_epitope_candidates_annotated.tsv” and “”_mhcII_epitope_candidates_annotated.tsv”” are created.
The following table describes each of the annotations in the output. MHC-I or MHC-II specific features will be only available in the respective table:
TABLE 2
JSON format#
Only when using the command neofox-epitope
an output file with the suffix “_neoepitope_candidates_annotated.json“ is created.
This file contains neoepitope candidates information in JSON format.
The names within the models are described in TABLE 2.
This is a dummy example of a “_neoantigen_candidates.json“ file.
This file contains a list of neoantigen candidate models (for further information, please see here.
To simplify, only one full neoantigen candidate model is shown:
[{
"patient_identifier": "Ptx",
"gene": "BRCA2",
"mutated_peptide": "AAAALAAAA",
"wild_type_peptide": "AAAAFAAAA",
"allele_mhc_i": "HLA-A*01:01",
"rna_expression": 0.519,
"imputed_gene_expression": 0.5,
"dna_variant_allele_frequency": 0.294,
"rna_variant_allele_frequency": 0.857,
"affinity_mutated": 2.567,
"rank_mutated": 0.898,
"affinity_wild_type": 1.023,
"rank_wild_type": 2.398,
"neofox_annotations": [...],
"external_annotations": [...]
}, {
"patient_identifier": "Ptx",
"gene": "BRCA2",
"mutated_peptide": "AAAAAAAAAAAAARAAAAAAAAAAAAA",
"wild_type_peptide": "AAAAAAAAAAAAAMAAAAAAAAAAAAA",
"isoform_mhc_i_i": "DRB1*01:01",
"rna_expression": 0.715,
"imputed_gene_expression": 0.5,
"dna_variant_allele_frequency": 0.173,
"rna_variant_allele_frequency": 0.556,
"affinity_mutated": 2.567,
"rank_mutated": 0.898,
"affinity_wild_type": 1.023,
"rank_wild_type": 2.398,
"neofox_annotations": [ ... ],
"external_annotations": [ ... ]
}]
Notice that for simplicity purposes both fields neofox_annotations
and external_annotations
are not shown above.
For further information, please see here.
For an example of the NeoFox annotations section, see the previous section.