Details

SpeciesHuman
Species subgroup
Subgroup typenone
Sequence NameIGHV4-4*10
IUIS NameIGHV4-4*10
Alternative namesIGHV4-4*i03
Affirmation Level1
Full Sequence
Coding Sequence
FunctionalityORF
Inference TypeGenomic Only
MappedFalse
Paralogs
Paralog RepFalse

Observations in VDJbase

Inferred sequences in VDJbase that match this sequence:

VDJbase Allele NameSubjectsSequence Match
IGHV4-4*02_a106g 1

Un-rearranged Observations

Un-rearranged sequence observations that support this sequence:

AccessionTypeRepositoryStartEnd
OU596101InferredGenBank1295

CDR delineation

CDR1 Start76
CDR1 End102
CDR2 Start154
CDR2 End174
CDR3 Start289

Non-Core Regions

UTR 5' Start
UTR 5' End
L-PART1 Start
L-PART1 End
L_PART1
L-PART2 Start
L-PART2 End
L_PART2
v_rs_start
v_rs_end
V_HEPTAMER
V_NONAMER

Extension

3' ExtensionA
3' start320
3' end320
5' start
5' end

Additional Information

Sequence IDA00072
CuratorWilliam Lees
Curator addressBirkbeck College, University of London, Malet Street, London
Version4
Release Date2023-08-16
Release Notes

Further D sequences added with the support of Rodriguez et al. ‘Likely’ extensions added to three IGHV alleles. See Notes for details.

LocusIGH
Sequence TypeV
Gene Subgroup4
Gene Designation4
Allele Designation10

Acknowledgements

Individuals acknowledged as contributing to this sequence:

No Items

Notes

Notes are added by IARC reviewers.

IARC meeting 63, Nov 24th, 2020:
The meeting considered the inference of the variant IGHV4-4*02_a106g, in the VDJbase dataset of sample P1_I42_S1. The sequence was seen in 1.27% of all unmutated rearrangements, with 268 sequences including 234 perfect matches to the inferred allele. There was abundant variation in the CDR3 regions of the aligned sequences. IGHV4-4*07 was also present in the genotype, at a lower frequency (0.73% of all unmutated sequences, 157 sequences, 135 unmutated sequences). Haplotyping data was not available. Plots of the final 3’ nucleotides of the inference were also unavailable. In light of the low sequence counts and the lack of haplotyping, the inferred sequence was affirmed as a Level 0 sequence. The final 3’ nucleotides will be considered at a later date, at which time the affirmed sequence will be noted in the IARC minutes.

IARC meeting 84; Oct 25th, 2021:
IGHV4-4*02_A106G was inferred in subject S39 (P1_I42_S1; ERR2567216). This inference has previously been pre-assessed at IARC meeting 63 (https://www.antibodysociety.org/wordpress/wp-content/uploads/2020/12/Meeting-63-24_11_20-minutes.pdf). The inference was supported by many sequences (294) and unmutated sequences (253) a high allelic frequency (64%), a high overall frequency in the unmutated population (1.5%) and many unique CDR3s (243) in the unmutated sequence set. Haplotyping based on alleles of IGHJ6 was not possible. Haplotyping based on different variant sequences of IGHD3-10*01 (DOI: 10.3389/fimmu.2019.00987), one of which is not recorded in the IMGT database, was possible (but was not part of the OGRDB submission as the variant IGHD allele is not present in the database used for inference). Separate analysis following IgDiscover-based inference suggested complete separation of IGHV4-4*02_A106G (IGHV4-4*02_S2599) relative to the other allele of IGHV4-4. Overall, the genotype also carried IGHV4-4*07, IGHV4-59*01, IGHV4-59*08, and IGHV4-61*02_A234G among this set of similar genes. The upstream regions of all these alleles have been inferred in this subject in the past (DOI: 10.3389/fimmu.2021.730105) and the upstream region of IGHV4-4*02_A106G is different from those of the other alleles of this set of genes. IARC now affirms, based on the extensive validation, the sequence at level 1 up to and including base 319 in agreement with past practice. It is acknowledged that the allele most likely carries one additional base, typically A at base position 320. We recognise that alleles of IGHV4-4/IGHV4-59/IGHV4-61 may residue in gene locations different from that associated to the most similar allele in the IMGT database. Although there is nothing in the data, including haplotype assessment, that suggest that the allele does not reside in gene IGHV4-4, it must be recognized that IARC gene naming does not reflect a position on the precise gene location of the allele to a specific gene. The allele is given the name IGHV4-4*i03.

>IGHV4-4*i03
CAGGTGCAGCTGCAGGAGTCGGGCCCAGGACTGGTGAAGCCTTCGGGGACCCTGTCCCTCACCTGCGCTGTCTCTGGTGGCTCCATCAGCAGTGGTAACTGGTGGAGTTGGGTCCGCCAGCCCCCAGGGAAGGGGCTGGAGTGGATTGGGGAAATCTATCATAGTGGGAGCACCAACTACAACCCGTCCCTCAAGAGTCGAGTCACCATATCAGTAGACAAGTCCAAGAACCAGTTCTCCCTGAAGCTGAGCTCTGTGACCGCCGCGGACACGGCCGTGTATTACTGTGCGAGAG.

Additional information for this sequence was imported into OGRDB via bulk update with the following notes:
Sequence annotation is based on Genbank sample OU596101
VDJbase example haplotype: P1_I42
One nucleotide truncation at 3’ end. The IARC states that its location is uncertain

Attachments

No Items

History

History logs the times and reasons for the publication of each version of this sequence.

Mats Ohlin
2021-11-01 22:51:02
Version 1 published

Submission published by IARC on November 1st, 2021

Mats Ohlin
2021-11-17 15:25:59
IMGT Name updated to IGHV4-4*10

IMGT Name updated.

William Lees
2023-07-10 11:24:41
Version 2 published

Bulk upload of sequences for the AIRR-C Human IG germline sets

William Lees
2023-07-24 14:13:43
Version 3 published

Remove unsupported extensions

William Lees
2023-08-16 13:47:31
Version 4 published

Further D sequences added with the support of Rodriguez et al. ‘Likely’ extensions added to three IGHV alleles. See Notes for details.

Changes from previous version

v3v4
Inference TypeUnrearranged and RearrangedGenomic Only
Extension?FalseTrue
3' ExtensionA
3' start320
3' end320
5' Extension

Versions

All published versions of this sequence.

Sequence NameIMGT NameAlternative namesInference TypeAffirmation LevelSpecies subgroupSubgroup typeGene startGene endUTR 5' StartUTR 5' EndL-PART1 StartL-PART1 EndL-PART2 StartL-PART2 EndCDR1 StartCDR1 EndCDR2 StartCDR2 EndCDR3 Startv_rs_startv_rs_endd_rs_3_prime_startd_rs_3_prime_endd_rs_5_prime_startd_rs_5_prime_endj_rs_startj_rs_endCodon FrameVersionDate
IGHV4-4*i03IGHV4-4*10Rearranged Only1none129576102154174289112021-11-01
IGHV4-4*10IGHV4-4*10IGHV4-4*i03Unrearranged and Rearranged1none129576102154174289132023-07-24
IGHV4-4*10IGHV4-4*10IGHV4-4*i03Genomic Only1none129576102154174289142023-08-16