Genebase Bionet Builder
Advertisement

The Genebase Learning Center

Archive for the 'mtDNA and its role in Ancestry' Category

Jun 21 2008

The mtDNA and its role in Ancestry: Part XVII (James “Earthquake McGoon” McGovern)

DNA Lesson Series: The mtDNA and its role in Ancestry
mtDNA Part I - mtDNA 101
mtDNA Part II - Facts about mtDNA
mtDNA Part III - mtDNA Structure
mtDNA Part IV - Ancestral Markers
mtDNA Part V - Detecting Mutations in the mtDNA
mtDNA Part VI - mtDNA Ancestral Markers
mtDNA Part VII - The Cambridge Reference Sequence
mtDNA Part VIII - mtDNA Test Types
mtDNA Part IX - mtDNA Haplogroup Determination
mtDNA Part X - mtDNA Subclades
mtDNA Part XI - mtDNA Haplogroup H
mtDNA Part XII - Subclades of mtDNA Haplogroup H
mtDNA Part XIII - Distribution of Subclades of H
mtDNA Part XIV - Descendents of Maria-Theresa
mtDNA Part XV - Luke the Evangelist
mtDNA Part XVI - Empress Feodorovna
mtDNA Part XVII - James “Earthquake McGoon” McGovern  <<– you are here

In the last three blogs, we have been discussing the topic of how anthropologists have used mtDNA to solve historical questions.  In particular, we have been focusing on the analysis of historical figures who may have belonged to Haplogroup H, one of the largest European Haplogroups.  We will wrap up our Haplogroup H series with this case about a World War II fighter ace and CIA pilot who died in combat in Vietnam. 

Case #4:  James “Earthquake McGoon” McGovern

Who was he?

James McGovern was a World War II fighter ace who died in a plane crash in Laos on May 6, 1954 when the Civil Air Transport plane that he was flying to provide ammunition to French troops was hit by groundfire.  McGovern and Buford (an American co-pilot) and two French Corporal Chiefs were killed instantly, making McGovern and Buford the first two Americans to die in combat in Vietnam. 

The source of his nickname:

At six feet and 260 pounds, McGovern was considered large for a fighter pilot, prompting his nickname “Earthquake McGoon” after the fierce and primitively charismatic (hulking hillbilly) wrestler from the popular Li’l Abner comic strip.

What were the anthropologists trying to find out? ie why was his DNA tested?

Between 1997 and 2002, multiple investigations of the site of incident were done by different teams (including joint US-Lao teams); however, only small fragments of aircraft wreckage were found, but no human burial sites were located.  In 2002, while investigating an unrelated crash near the Ban Sot area, the JTF-FA team discovered an old C-119 (the type of plan that McGovern was flying) propeller.  A few months later, human remains of a single individual were discovered from an unmarked grave, leading to speculation that the grave may have belonged to James McGovern.

The main purpose of the project was to determine whether the remains belonged to James McGovern.

Who’s Who in researching this project:

  • Irwin et al from Armed Forces DNA Identification Laboratory, Armed Forces Institute of Pathology, Rockville, USA.
  • Holland et al from Joint POW/MIA Accounting Command - Central Identification Laboratory, Hawaii, USA

Top peer reviewed research publications for genetic identification of James McGovern:

This table lists the most significant papers for the James McGovern case and the DNA tests that were performed to identify his remains in peer reviewed journals.  Links are provided to access the original articles.

These papers provide the extent of what is known today about the DNA of James McGovern and provides answers to the question of whether the remains discovered in the grave belong to the James McGovern.

Name of Scientific Article Scientific Journal
DNA Identification of “Earthquake McGoon” 50 Years PostmortemIrwin et alArmed Forces DNA Identification Laboratory, Armed Forces Institute of Pathology, Rockville, USA Click here to read abstracthttp://www.blackwell-synergy.com/doi/abs/10.1111/j.1556-4029.2007.00506.x J Forensic Sci.  2007 Sep; 52(5):  1115-8.

As further information becomes available, this table will be updated.

Solving the case: Collecting DNA samples

The first step in solving the case involved collecting DNA samples from the remains and locating living relatives of James McGovern for DNA testing. 

  1. The remains:  DNA was extracted from the left femur obtained in the excavation (putative McGovern).
  2. The relatives:  DNA samples were collected from living relatives of James McGovern including: 1 maternal cousin (McGovern’s mother’s sister’s son), 1 sister-in-law, 4 nieces (daugthers of sister-in-law), 1 nephew (sone of sister-in-law)

The relationship of each individual tested in relation to James McGovern are shown in the pedigree below:

The DNA testing process:

Mulitple DNA testing types were performed on the remains and the relatives.  To follow are the DNA test types used in this study:

  1. mtDNA HVR1 and mtDNA HVR2 sequencing - to confirm maternal lineage
  2. mtDNA SNP testing - to supplement the HVR1 and HVR2 results by improving the discrimination power of the test
  3. Autosomal STR testing - to confirm relationships
  4. Y-DNA STR testing - to confirm paternal lineage

The goal of the various test types is to firmly establish the identity of the remains by looking for proof of relationship along the maternal and paternal lines to known descendents of McGovern’s family.

Results of the DNA tests:

Individual Tested mtDNA HVR1 and HVR2 Results mtDNA Coding Region Results Y-DNA STR Results
Remains (putative James McGovern) Same as Maternal Cousin of McGovern, different from Buford maternal reference, 16519 +ve 3010 +ve      4793 –ve     10211 –ve    5004 –ve    7028 –ve    7202 –ve    12852 –ve    4580 –ve    477 –ve    14470 –ve      Same as Paternal Nephew of McGovern
Maternal Cousin of McGovern (female) Same as Remains, different from Buford maternal reference, 16519 +ve 3010 +ve      4793 –ve     10211 –ve    5004 –ve    7028 –ve    7202 –ve    12852 –ve    4580 –ve    477 –ve    14470 –ve     Not tested (females do not have Y-DNA)
Paternal Nephew of McGovern (male) Not tested Not tested Same as Remains
Buford’s maternal reference Different from Remains, different from Maternal Cousin of McGovern Not tested Not tested

Summary of Results:

The results of the DNA tests were able to confirm the following:

  1. The mtDNA profile of the Remains was identical to the mtDNA profile of McGovern’s Maternal Cousin.
  2. The Y-DNA STR haplotype of the Remains was identical to the Y-DNA STR haplotype of McGvoern’s Paternal Nephew.

Based on statistical analysis of data available, the likelihood ratio that the Remains belong to a male individual who is related to McGovern’s living family members is 96,900, thus successfully establishing that the remains are indeed those of McGovern who died in 1954.

According to Dr. Thomas Holland (director of JPAC’s Central Identification Laboratory), McGovern was the second person ever identified by their laboratory through forensic analysis of Y-DNA.  Most cases involving old and highly degraded samples tend to rely on mtDNA analysis since mtDNA is far more stable than Y-DNA when examining extremely old or degraded samples. 

The Aftermath:

A military funeral was held in New Jersey on October 28, 2005 to honor this well-known US military pilot. 

Haplogroup Analysis:

Although the original researchers did not intend to determine the haplogroup of James McGovern, amateur genetic genealogists have stated that McGovern belonged to Haplogroup H.  Let’s take a look at how they have arrived at this conclusion:

Step 1:  To begin, click here to download the mtDNA Haplogroup Map.  You will need to use this map to follow along in this discussion.

Step 2:  Identify the presence and absence of McGovern’s mtDNA markers on the map.  Starting from the CRS, move outwards and cross off all markers that McGovern does not have, circle all of the markers that he does have and put a question mark next to the markers that have not been tested:

Based on the coding region results, McGovern is negative for the marker 7028, thus excluding him from all haplogroups except Haplogroup H.  The absence of markers in his HVR1 (except for 16519), and the absence of coding region marker 4580 further suggests that McGovern belongs to Haplogroup H. 

Can we determine his subclade?

If McGovern is truly a member of Haplogroup H, let’s see if we can find out which Subclade of H he belongs to.  To proceed, click here to download and print the H Subclade map, as you will need it to follow along with this discussion.

Again, circle the markers that McGovern does have and cross out the markers that McGovern definitely does not have.  In this case, we will leave all unknown markers untouched.

McGovern is positive for the coding region marker 3010, which is a defining marker for Subclade H1.  However, he is negative for 477, indicating that he is not from sublineage H1c, and he is negative for 12858, indicating that he is not likely H1c2.  Other coding results indicate that McGovern is negative for 5004, indicating that he is not likely H4; he is negative for 4793, indicating that he is not likley H7; and he is negative for 14470, indicating that he is not likely H10. 

Based on the known DNA markers results for McGovern obtained to date, it is safe to conclude that he is very likely a member of Haplogroup H.  If he is indeed a member of Haplogroup H, his most likely a descendent of Subclade is H1.  However, in order to confirm his Subclade, a Subclade test would be required to examine markers 1638, 6776, 13101, 4310, 8448, 3936, 14872, 11377, 6352, 1039, 3915, 13708, 14869 and 8494 to exclude the possibility of other H Subclades, including H2, H3, H8, H9, H11, H12, H13, H14, H15, H16, H17, H18, H19, and H21.  Other markers that would be useful to examine are 6365, 8271, 3796, 9066 and 9150 to determine which further sublineage of H1 he belongs to. 

Jun 04 2008

The mtDNA and its role in Ancestry: Part XVI (Empress Feodorovna)

DNA Lesson Series: The mtDNA and its role in Ancestry
mtDNA Part I - mtDNA 101
mtDNA Part II - Facts about mtDNA
mtDNA Part III - mtDNA Structure
mtDNA Part IV - Ancestral Markers
mtDNA Part V - Detecting Mutations in the mtDNA
mtDNA Part VI - mtDNA Ancestral Markers
mtDNA Part VII - The Cambridge Reference Sequence
mtDNA Part VIII - mtDNA Test Types
mtDNA Part IX - mtDNA Haplogroup Determination
mtDNA Part X - mtDNA Subclades
mtDNA Part XI - mtDNA Haplogroup H
mtDNA Part XII - Subclades of mtDNA Haplogroup H
mtDNA Part XIII - Distribution of Subclades of H
mtDNA Part XIV - Descendents of Maria-Theresa
mtDNA Part XV - Luke the Evangelist
mtDNA Part XVI - Empress Feodorovna  <<– you are here
mtDNA Part XVII - James “Earthquake McGoon” McGovern

In the last few blogs, we have been discussing the topic of how anthropologists have used mtDNA to solve historical questions.  In this blog, we will continue our analysis of historical figures who may have belonged to haplogroup H.

Case #3:  Empress Alexandra Feodorovna of Russia

Who was she?

Empress Alexandra Feodorovna of Russia, the granddaughter of Queen Victoria was born on June 6, 1872 as Princess Alix of Hess and by Rhine in Darmstadt (Germany).  She was the sixth child of Grand Duke Louis IV of Hesse and by Rhine and Princess Alice of the United Kingdom.  She was the princess consort of Nicholas II, and the last Tsaritsa of Russia. 

What were the anthropologists trying to find out? ie why was her DNA tested?

At the end of the February Revolution of 1917, Tsar Nicholas II was forced to abdicate.  The Romanov family were imprisoned in Ipatiev House at Ekaterinburg in the Urals of Central Russia in 1918 with three servants and the family doctor.  On the night of July 16, 1918, the entire Romanov family was shot and killed by the Bolshevik firing squad. 

According to legend, two bodies were burnt and the others were buried in a roadside pit.  To hinder identification, sulphuric acid was said to have been thrown into the open grave and a truck was driven back and forth over the grave. 

The case remained a mystery until July 1991, when nine human skeletons were discovered in a shallow pit around 20 miles from Ekaterinburg, Russia.  Historians speculated that the remains belonged to the Romanov family.  To help solve the mystery, DNA testing was performed to identify the remains.

Who’s who in researching the history of the Romanov family:

  • Gill et al. from the Central Research and Support Establishment, Forensic Science Service, Aldermaston, Reading, Berkshire, UK
  • Ivanov et al. from Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
  • Hagelberg et al. from Department of Biological Anthropology, University of Cambridge, Cambridge, UK
  • Wadhams et al. from Armed Forces DNA Identification Laboratory, Office of the Armed Forces medical Examiner, Armed Forces Institute of Pathology, Rockville, Maryland, USA
  • Hofreiter et al. from Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
  • Loreille et al. from Armed Forces DNA Identification Laboratory, Office of the Armed Forces medical Examiner, Armed Forces Institute of Pathology, Rockville, Maryland, USA
  • Knight et al. from Department of Anthropological Sciences, Stanford University, Stanford, USA
  • Zhivotovsky et al. from Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
  • Kass et al. from Department of Biology, Eastern Michigan University, Ypsilanti, USA
  • White et al. from Bioscience Division, Los Alsmos national Laboratory, Los Alamos, USA
  • Mountain et al. from Department of Genetics, Standford University, Stanford, USA

Top peer reviewed research publications for Tsarina Alexandra and the Romanov family:

This table lists the most significant papers for the Romanov family and the DNA tests that were performed to identify their remains in peer reviewed journals.  Links are provided to access the original articles.

These papers provide the extent of what is known today about the DNA type of Tsarina Alexandra and provides answers to the question of whether the bodies discovered in the grave belong to the Romanov Family.

Name of Scientific Article Scientific Journal
Identification of the remains of the Romanov family by DNA analysis.  Gill et al, Forensic Science Service, Berkshire, UK.  Click here to read abstract Nat Genet. 1994 Feb; 6(2):130-5.
DNA forensics. Buried, recovered, lost again? The Romanovs may never rest.Stone et alThis publication is published in Science magazine, a subscriptions only magazine.  Click here to go to Science magazine. Science. 2004 Feb 6; 303(5659):753.
Molecular, forensic and haplotypic inconsistencies regarding the identity of the Ekaterinburg remainsKnight et al, Stanford University, Stanford, USA.  Click here to read abstract Ann Hum Biol.  2004 Mar-Apr; 31(2):129-38.
Ongoing controversy over Romanov remains.Hofreiter et al, Gill et al, Knight et alThis publication is published in Science magazine, a subscriptions only magazine.  Click here to go to Science magazine. Science. 2004 Oct 15;306(5695):407-8, 408-9, 409-10

As more articles become available, they will be listed here.

Solving the case:  Collecting DNA samples from the Ekaterinburg remains

The first step in solving this case is to test the DNA of the Ekateringburg remains.  After the DNA type of the remains is known, it can be compared to the DNA type of known relatives in the royal family.  Since Tsarina Alexandra is the maternal granddaughter of Queen Victoria (Queen Victoria is her mother’s mother), Tsarina Alexandra must have inherited her mtDNA from Queen Victoria.  A living descendent of Queen Victoria who carries Queen Victoria’s mtDNA type is Prince Philip.  The following diagram shows in purple how the mtDNA is passed down from Queen Victoria to her descendents and which one of her descendents carry her mtDNA:

Individuals who inherited the mtDNA type of Queen Victoria are highlighted in purple.

The descendent tree for Queen Victoria indicates that Tsarina Alexandra should have the same mtDNA profile as Prince Philip as well as other living and deceased descendents from the same line.  By collecting the DNA sample from known family members, scientists can find out the expected mtDNA type for Tsarina Alexandra and her children and use it to identify the remains.

Who was tested to solve this case?

  1. A blood sample was collected from Prince Philip, a living descendant of Queen Victoria (maternal grandmother of Tsarina Alexandra).
  2. Bone fragments from each of the nine skeletal remains found in Ekaterinburg.

Three Types of DNA tests were performed to confirm the identity of the Ekateringburg remains:

  1. Gender determination:  To identify the gender of each of the nine skeletal remains.
  2. Autosomal STR testing:  To determine whether the nine skeletons are related to each other, ie in the same family.
  3. Mitochondrial HVR1 and HVR2 sequencing:  To examine whether the skeletal remains are members of the Romanov family by comparing the mtDNA profile of the remains to the mtDNA of known royal family members.

Step 1:  Autosomal STR Testing:  To determine whether the 9 skeletons are related to each other

Autosomal STR testing (the same technology used for paternity testing) was used to determine whether the nine skeletons are from the same family. 

Skeleton #

Autosomal DNA Markers Relationship
VWA TH01 F13A01 FESFPS ACTBP2
3 15, 16 8, 10 5, 7 12, 13 11, 32 Child
4 15, 16 7, 10 7, 7 12, 12 11, 32 Parent
5 15, 16 7, 8 5, 7 12, 13 11, 36 Child
6 15, 16 8, 10 3, 7 12, 13 32, 36 Child
7 15, 16 8, 8 3, 5 12, 13 32, 36 Parent

The results of the autosomal test confirmed the following:

  1. Skeletons 3, 4, 5, 6, and 7 belonged to the same family.
  2. Skeletons 3, 4, 5, 6 and 7 are likely a family group consisting of three children and both of their parents. 
  3. Skeletons 1, 2, 8, 9 are not related to skeletons 3, 4, 5, 6, and 7.

This information is consistent with the skeletons belonging to members of the Romanov family. 

Step 2:  Determining the gender of the skeletons

The next set of tests sets out to determine the gender of the skeletons.

Skeleton # Sample Type Sample Quality Gender Relationship
3 Bone Good Female Daughter
4 Bone Good Male Father
5 Bone Good Female Daughter
6 Bone Good Female Daughter
7 Bone Good Female Mother

The results of the gender determination tests showed that four of the skeletons are male and one was female.  The combined results of the autosomal test and gender test leads to the speculation that skeleton #4 is the father, skeleton #7 is the mother, and skeletons 3, 5, and 6 are their daughters.

Step 3:  What if the skeletons belonged to the Romanov family?

If the remains truly belong to the Romanov family, then:

  • Skeleton 4 “father” should belong to Tsar Nicholas II
  • Skeleton 7 “mother” should belong to Tsarina Alexandra
  • Skeletons 3, 5, and 6 are three of their 4 daughters
Skeleton # Relationship Speculated Identity (if the Romanovs)
3 Daughter Daughter #1
4 Father Tsar Nicholas II
5 Daughter Daughter #2
6 Daughter Daughter #3
7 Mother Tsarina Alexandra

The next step is to confirm the speculated identities by comparing each skeleton to living royal family members. 

Step 4:  Solving the mystery.  Confirming the identity of Skeleton #7, Tsarina Alexandra and her daughters, Skeletons 3, 5, and 6

Now that the preliminary studies indicate the possible identities of each of the skeletons, the next step is to confirm the identity by comparing to the DNA of living royal family members.  Tsarina Alexandra is the grandaughter of Queen Victoria, and her mtDNA should be identical to the mtDNA of Prince Philip, a living descendent of Queen Victoria. 

The next step is to test the mtDNA of Prince Philip and compare it to the mtDNA of the skeletons.

Prince Philip’s mtDNA:

Name HVR1 Region HVR2 Region Coding Region Publication
Prince Philip 16111,16357 263,315.1 Not tested Gill et al 1994

Prince Philip is a direct descendent of Queen Victoria and he carries the mtDNA type of Queen Victoria. 

The mtDNA of the Skeletons:

Name HVR1 Region HVR2 Region Coding Region Publication
Skeleton #3:Putative daughter 16111,16357 263,315.1 Not tested Gill et al 1994
Skeleton #4:Putative Tsar Nicholas II 16126,16169Y,16294,16296 73,263,315.1  Not tested Gill et al 1994
Skeleton #5:Putative daughter 16111,16357 263,315.1 Not tested Gill et al 1994
Skeleton #6:Putative daughter 16111,16357 263,315.1 Not tested Gill et al 1994
Skeleton #7:Putative Tsarina Alexandra 16111,16357 263,315.1 Not tested Gill et al 1994

 Conclusion:  The results of the mtDNA test confirm the following:

  1. The putative Tsarina Alexandra’s mtDNA (skeleton 7) is a perfect match to the mtDNA profile of Prince Philip, indicating that it is the actual skeleton of Tsarina Alexandra.
  2. Tsarina Alexandra’s mtDNA is a perfect match to the mtDNA profile of Skeletons 3, 5, and 6, her putative daughters.
  3. Skeletons 3, 5, and 6 are also identical to Prince Philip’s mtDNA, indicating that they are the actual skeletons of Tsarina Alexandra’s daughters.
  4. The putative Tsar Nicholas II’s mtDNA (skeleton 4) does not match the mtDNA of Prince Philip.  This is expected since Tsar Nicholas II is not a direct descendent of Queen Victoria.  We will discuss the set of analysis that was conducted to confirm the identity of Tsar Nicholas II in a separate blog.

In summary, the DNA studies confirm that five out of the nine skeletons discovered truly belonged to the Romanov family, namely Tsar Nicholas II, Tsarina Alexandra, and 3 of their 4 daughters.  The skeleton of one of their daughters and their only son, Alexei, remain missing.

The Aftermath:  Doubts and Controversies

Since the findings, the skeletons were reburied with honors in the imperial-era capital of St. Peterburg.  The two remaining children remain missing.

The scientific findings and results have since been challenged by Knight et al, suggesting irregularities and sample handling deficiencies.

Recent Findings from the summer of 2007

In the summer of 2007, using metal detectors and metal rods as probes, two burned partial skeletons were discovered at a bonfire site near Ekaterinburg, 900 miles east of Moscow.  15 intact bone fragments and more than 40 pieces of charred bone were discovered.

The site appeared to match the site described in Yurovsky’s (the Bolshevfik officer in charge of the Romanovs’ captivity) memoirs.  According to his memoirs, the bodies of nine victims were doused with sulphuric acid and buried along a road.  Alexei’s body and one of his sister’s bodies was burned and left in a pit nearby

Preliminary analysis of the bones showed that the remains were from a boy roughly between the ages of 10 to 13 years old at the time of death, the putative Alexei and a young woman roughly between the ages of 18 to 23 years old at the time of death, attributed to one of the Romanov daughters, likely Maria or  Anastasia.

Genetic investigations were performed in the Sverdlovsk Regional Forensic Medicine Bureau, a lab in Moscow, and a US laboratory, the University of Massachusettes Medical School. 

On April 30, 2008, the groups claimed that DNA testing (mainly mitochondrial DNA analysis) proved that the remains belonged to Tsarevich Alexei and one of his sisters (likely Maria) were released in several major newspapers and news sources (including New York Times, BBC, and MSNBC).

Article Year
Bones turn up in hunt for last czar’s son 2007
Probe reopened into death of last Russian czar 2007
DNA tests may solve mystery of last czar’s heirs 2008
Report: DNA proves link to last czar’s kids 2008
Russia dig finds ‘tsar’s family’ 2007
Lost Romanov bones ‘identified’ 2007
Tsar’s lost children identified 2008
Experts May Have Found Remains of Czar’s Children 2008
Amateurs Unravel Russia’s Last Royal Mystery 2007
DNA Tests Confirm the Deaths of the Last Missing Romanovs 2007

What is the haplogroup of Queen Victoria’s descendents?

Now that the mtDNA type of Queen Victoria’s line is known, amateur genealogists from around the world have used the data to compare to their own families to see if they may have links to royalty.

Although the original researchers never attempted to use the data from the study to determine the haplogroup of this royal family line, amateur genealogists have tried t use the data from the study to determine the mtDNA haplogroup of Queen Victoria’s line, concluding that Queen Victoria and her descendent Tsarina Alexandra belonged to Haplogroup H.  Let’s take a look at the raw data and see how accurate they are.

Step 1:  Click here to download and print the mtDNA Haplogroup map so that you can follow along with the discussion.

Step 2:  Identify the presence and absence of HVR1 markers on the map.

Tsarina Alexandra only had two HVR1 markers, namely 16111 and 16357.  On the map, all HVR1 markers are in blue.  Starting from the CRS, move outwards and cross off all of the HVR1 markers that Tsarina Alexandra does not have, circle the ones that she does have and put a question mark next to the ones that are unknown:

The results of the HVR1 test helps to eliminate the haplogroups that Tsarina Alexandra definitely does not belong to, and shows that Tsarina Alexandra must belong to either Haplogroup R, Pre-HV, HV, H, or the CRS branch of H.

Step 3:  Identify the presence or absence of HVR2 markers on the map:

Tsarina Alexandra has 2 markers in her HVR2 region, namely 263 and 315.1.  On the map, all HVR2 markers are red.  Starting from the CRS, move outwards and cross off all of the HVR2 markers that Tsarina Alexandra does not have and circle all of the HVR2 markers that she does have:

The results of the HVR2 test helps to eliminate Haplogroup R and shows that Tsarina Alexandra must belong to either Haplogroup Pre-HV, HV, or H.  Tsarina Alexandra has marker 263 which brings her away from CRS, indicating that she is unlikely to belong to CRS, but the presence of 263 cannot eliminate her from CRS as such a conclusion woudl require more data from the coding region.

In conclusion, the results of the scientific studies show that Tsarina Alexandra can belong to any one of the following Haplogroups:

  • H
  • HV
  • Pre-HV

These results are similar to those of Marie Antoinette, even though the exact mtDNA mutations that they carry are slightly different.  To confirm whether Tsarina Alexandra is really a member of Haplogroup H, one would need to test the coding region of her mtDNA to confirm that she does not have markers 7028 and 14766.  However, no coding region data was ever available for Tsarina Alexandra or for any of her living or deceased relatives, so there is no way to confirm that she is a member of Haplogroup H.  Since the original researchers were aiming to identify her remains rather than to determine her haplogroup, the coding region was never examined and in this particular case, coding region SNP test is required to confirm her haplogroup.

In conclusion, it is pre-mature to conclude that Tsarina Alexandra is a member of Haplogroup H, but the data from the studies to date do serve to narrow down her possible haplogroups to H, HV, and Pre-HV.  Further studies woud need to focus on markers 7028 and 14766 in the coding region which would provide a determination of which Haplogroup Tsarina Alexandra actually belongs to, and if she does in fact belong to Haplogroup H, further H subclade testing using the H subclade test panel should confirm which branch of H she falls into.

May 22 2008

The mtDNA and its role in Ancestry: Part XV (Luke the Evangelist)

DNA Lesson Series: The mtDNA and its role in Ancestry
mtDNA Part I - mtDNA 101
mtDNA Part II - Facts about mtDNA
mtDNA Part III - mtDNA Structure
mtDNA Part IV - Ancestral Markers
mtDNA Part V - Detecting Mutations in the mtDNA
mtDNA Part VI - mtDNA Ancestral Markers
mtDNA Part VII - The Cambridge Reference Sequence
mtDNA Part VIII - mtDNA Test Types
mtDNA Part IX - mtDNA Haplogroup Determination
mtDNA Part X - mtDNA Subclades
mtDNA Part XI - mtDNA Haplogroup H
mtDNA Part XII - Subclades of mtDNA Haplogroup H
mtDNA Part XIII - Distribution of Subclades of H
mtDNA Part XIV - Descendents of Maria-Theresa
mtDNA Part XV - Luke the Evangelist  <<– you are here
mtDNA Part XVI - Empress Feodorovna
mtDNA Part XVII - James “Earthquake McGoon” McGovern

In the last blog, we began discussing how anthropologists have used mtDNA to solve historial questions.  In this blog, we will continue our analysis of historical figures who may have belonged to haplogroup H.

Case #2:  Luke the Evangelist

Who was he?

Luke the Evangelist is the patron saint of physicians and surgeons and his feast day is October 18.  He is said to be the author of the third Gospel and the Acts of the Apostles.  Luke was born in the city of Antioch (an ancient city in the Roman province of Syria, present day Antakya, Turkey).  He died around A.D. 150 at the age of 84 years in Thebes, the capital of Boeotia, Greece. 

What were the anthropologists trying to find out? ie why was his DNA tested?

Although the body of Saint Luke was initially buried in Thebes, it was later transfered from Greece to Constantinople (capital of the Byzantine Empire, present day Istanbul, Turkey) during the second year of reign of emperor Constantius around A.D. 338.  The body was transferred a second time around A.D. 1177, this time from Constantinople to its current resting place in Pauda, Italy. 

Figure 1:  The transfer of the body of Saint Luke

Historians have long questioned the identity of the body attributed to Saint Luke that now lies in Padua, Italy.  In particular, they wonder whether the body may have been replaced in Greece or Turkey and are curious about whether the body was of Syrian origin (the location of St. Luke’s birthplace) or of Turkish or Greek origin (the locations where the body may have been replaced).  Using modern DNA technology, anthropologists set out to answer this question by obtaining DNA samples from the body attributed to Saint Luke and comparing it to the mtDNA type of population samples from Syria, Turkey and Greece to see if they can determine the most likely geographic origin of the body.

Who’s who in researching the history of Saint Luke:

The main research groups studying this area are:

  • Vernesi et al from the Department of Biology, University of Ferrara, Ferrara, Italy
  • Caramelli et al from Institute of Anthropology, University of Florence, Florence, Italy
  • Simoni et al from Genetics and Biometry Laboratory, Department of Anthropology, University of Geneva, Geneva, Switzerland
  • Malaspina et al from Department of Biology, University of Rome, Rome, Italy
  • Novellotto et al, Department of Biology, University of Calabria, Arcavacata di Rende, Italy
  • Marin et al, Institute of Pathological Anatomy, Univerity of Padua, Padua, Italy

Table 1:  Top peer reviewed research publications for Saint Luke

This table lists the most significant papers for Luke the Evangelist’s DNA profile in peer reviewed journals, with links to access the original publications. 

These papers provide the extent of what is known today about the DNA type of Luke the Evangelist and provides answers to the question of whether the body attributed to Luke the Evangelist belongs to a Greek or a Syrian.

Name of Scientific Article Scientific Journal
Genetic characterization of the body attributed to the evangelist Luke Vernesi et al University of Ferrara, Ferrara, Italy Click here to view and download a copy of the original publication Proc Natl Acad Sci U S A. 2001 Nov 6;98(23):13460-3. Epub 2001 Oct 16.

As new papers become available, they will be added to this list.

Investigating the case:  Collecting a DNA sample from the body attributed to Saint Luke and collecting DNA samples from individuals from Greek and Syria:

The first step in solving the mystery is to decode the mtDNA of the body attributed to Saint Luke.  Once the mtDNA code is known, it can be compared to the mtDNA code from individuals from different parts of Europe to gain clues into the ancestral origin of the body. 

First step:  Collecting DNA from the body attributed to Saint Luke in Italy:

What was collected?  A whole canine tooth, a tooth root and some bone fragments were collected from a sarcophaagus containing the body traditionally attributed to St. Luke.  Only the whole canine tooth contained sufficient DNA to perform the DNA test and subsequent studies focussed on DNA obtained from the tooth.

What were the results? 

Name Sample type Sample quality HVR1 Region HVR2 Region Coding Region Publication
Evangelist Luke Tooth sufficient sample quality 16235, 16291 Not tested Only tested for SNP marker 7028.  Results indicate 7028 positive Vernesi et al, 2001

The researchers sequenced the HVR1 region of the mtDNA extracted from the tooth.  No analysis was performed on the HVR2 region.  The researchers also tested a single SNP marker, namely marker 7028 from the coding region of the mtDNA.

The next step:  Gathering mtDNA from individuals from Syria, Greece, and Turkey

Next, the researchers obtained the mtDNA HVR1 sequence for individuals from Syria, Greece and Turkey for comparison with the results of the mtDNA from the body attributed to Saint Luke.

Location collected # of individuals tested
Syria (Arabic-speaking northern Syrians) 49
Greece 48 (30 from Attica and 18 from Crete)
Turkey 96

What was concluded from the results?

Upon comparing the HVR1 region of the mtDNA obtained from the body attributed to Saint Luke against the HVR1 region of the mtDNA from individuals currently living in Syria, Greece and Turkey, the researchers found the closest match to individuals from Syria.  The researchers claimed that statistical interpretations of the data suggest that the match to individuals from Syria was three time greater than the match to individuals from Greece, and the also detected a slightly closer matching frequency to people from Syria versus people from Turkey.  The results of this study indicate that the body is most likely of Syrian origin, the original birthplace of Saint Luke, thus helping to dispel theories that the body was replaced which it was transfered to Greece or Turkey.

Further studies comparing more regions of the mtDNA, including the HVR2 region and coding region as well as studies comparing the DNA from the body to a larger group of people from Syria, Greece, and Turkey would increase the power of this study.

The aftermath:  What has happended since this study was completed?

With the mtDNA HVR1 region of Luke the Evangelist known, amateur genealogists from around the world have used the data to compare to their own familites to see if they have any ties to Luke’s maternal family line.

Although no in depth haplogroup analysis was performed by the original scientists, amateur genealogists have tried to use the data from the study to determine the mtDNA haplogroup of Luke the Evangelist, stating that he belongs to haplogroup H.  Let’s take an in depth look at the raw data and see how accurate they are.

Step 1:  Click here to download and print the mtDNA Haplogroup map so that you can follow along with the discussion.

Step 2:  Identify the presence and absence of Saint Luke’s HVR1 markers on the map.

Saint Luke has two markers in his HVR1 region, namely 16235 and 16291.  On the map, all HVR1 markers are blue.  Starting from the CRS, move outwards and cross off all of the HVR1 markers that Saint Luke does not have, circle all of the markers that he does have, and put a question mark next to the markers that have not been tested:

The results of the HVR1 test helps to eliminate the haplogroups that Saint Luke definitely does not belong to, and shows that Saint Luke must belong to either haplogroup R, Pre-HV, HV, or H.  However, the results of the SNP test for coding region marker 7028 indicates that Saint Luke is positive for the 7028 marker, which brings him away from haplogroup H, towards haplogroups HV, Pre-HV, or R. 

Region tested Marker Result Conclusion
HVR1 Absence of 16189 not likely haplogroup B
HVR1 Absence of 16304 not likely haplogroup F
HVR1 Absence of 16126 not likely haplogroup J and T
HVR1 Absence of 16223 Likely descendant of R branch (includes B, F, J, K, P, T, U, Pre-HV, HV, H, Pre-V, and V)
HVR1 Absence of 16224 and 16311 not likely haplogroup K
HVR1 Absence of 16249 not likely subclade U1
HVR1 Absence of 16051 and 16129 not likely subclade U2
HVR1 Absence of 16343 not likely subclade U3
HVR1 Absence of 16356 not likely subclade U4
HVR1 Absence of 16270 not likely subclade U5
HVR1 Absence of 16172 and 16219 not likely subclade U6
HVR1 Absence of 16318 not likely subclade U7
HVR1 Absence of 16298 not likely haplogroup Pre-V and V
Coding region 7028 Positive for 7028 marker Less likely haplogroup H

Based on the results of Saint Luke’s mtDNA test, it is impossible, even possibly erroneous, to conclude that Saint Luke belongs to haplogroup H since there is not sufficient information from the HVR1 test results to reach that conclusion. 

The results of scientific studies to date show that Saint Luke can belong to any one of the following Haplogroups:

  • HV
  • Pre-HV
  • R

The presence of marker 7028 brings Luke away from haplogroup H, but further studies would be required in order to exclude H.

In conclusion, it is premature or even erroneous to conclude that Saint Luke belongs to haplogroup H, and data so far suggests that Saint Luke more likely belongs to HV, Pre-HV, or R.  Further studies would need to focus on markers 11719 and 14766 in the coding region, and 73 and 263 in the HVR2 region, which would provide a conclusive determination of which haplogroup Saint Luke actually belongs to.

May 17 2008

The mtDNA and its role in Ancestry: Part XIV (Descendents of Maria-Theresa)

DNA Lesson Series: The mtDNA and its role in Ancestry
mtDNA Part I - mtDNA 101
mtDNA Part II - Facts about mtDNA
mtDNA Part III - mtDNA Structure
mtDNA Part IV - Ancestral Markers
mtDNA Part V - Detecting Mutations in the mtDNA
mtDNA Part VI - mtDNA Ancestral Markers
mtDNA Part VII - The Cambridge Reference Sequence
mtDNA Part VIII - mtDNA Test Types
mtDNA Part IX - mtDNA Haplogroup Determination
mtDNA Part X - mtDNA Subclades
mtDNA Part XI - mtDNA Haplogroup H
mtDNA Part XII - Subclades of mtDNA Haplogroup H
mtDNA Part XIII - Distribution of Subclades of H
mtDNA Part XIV - Descendents of Maria-Theresa  <<– you are here
mtDNA Part XV - Luke the Evangelist
mtDNA Part XVI - Empress Feodorovna
mtDNA Part XVII - James “Earthquake McGoon” McGovern

We have now had a chance to talk about mtDNA, what it is, what it tells us, and how mtDNA testing works.  Now, lets take a look at a fun topic and see how anthropologists have used mtDNA to help solve historical questions.  Since we have been focusing on mtDNA haplogroup H in the last few blogs, we will continue on that note and talk about some historical figures who may have belonged to haplogroup H.

Case #1:  The descendents of Maria-Theresa, Holy Roman Empress:  Using mtDNA to track the case of Louis XVII, son of Marie Antoinette

Marie-Antoinette (1755-1793), was executed in 1793 in Paris.  After her execution, her son and daughter remained imprisoned in the Temple of Paris.  According to official records, her son, Louis-Charles died of tuberculosis in the Temple on June 8, 1795.  Thoughout history, historians have wondered whether the boy who died in the Temple was truly Louis-Charles, or whether Louis-Charles had escaped and survived.  To add to this speculation, several individuals later claimed to be the son of Marie Antoinette.  The most notable was Wilhelm Naundorff, who died in 1845 and was buried under the name of Louis-Charles Duc de Normandie, ‘Louis XVII’, and his descendents were permitted to use the name ‘de Bourbon’, the name of the French royal family.

This case remained a mystery for historians for over a century, until modern DNA testing allowed historians to finally put this case to rest.  The studies conducted focused on testing the mtDNA of the remains of Wilhelm Naundorff and forensic samples from various members of Marie Antoinette’s family, descendents of Marie Antoinette’s mother, Maria-Theresa, Holy Roman Empress.

Who’s who in researching the history of Marie Antoinette’s family:

The main research groups studying this area are:

  • Jehaes et al from Center for Human Genetics, University of Leuven, Belgium
  • Peneau et al from Laboratoire de Genetique Moleculaire, CHRU, Nantes, France
  • Petrie et al from Petrus Campussingel, Groningen, The Netherlands
  • Boiry et al from Faculte Libre des Sciences de la Communication, Levallois-Perret, France

Table 1:  Top peer reviewed research publications for Marie Antoinette

This table lists the most significant papers for Marie Antoinette’s DNA profile in peer reviewed journals, with links to access the original publications.  These papers provide the extent of what is known today about the DNA type of  Maria-Theresa’s descendents and provides answers to the question of the true fate of Marie Antoinette’s son, Louis XVII.

Name of Scientific Article Scientific Journal
Mitochondrial DNA analysis on remains of a putative son of Louis XVI, King of France and Marie-Antoinette.Jahaes et alUniversity of Leuven, BelgiumClick here to view and download a copy of the original publication Eur J Hum Genet. 1998 Jul-Aug;6(4):383-95.
Mitochondrial DNA analysis of the putative heart of Louis XVII, son of Louis XVI and Marie-Antoinette.Jehaes et alUniversity of Leuven, BelgiumClick here to view and download a copy of the original publication Eur J Hum Genet. 2001 Mar;9(3):185-90.

Solving the case:  Collecting the DNA sample from royal family members:

The first step in solving this mystery is to find DNA clues for the royal family.  Since Louis-Charles is the biological son of Marie Antoinette, the true biological son of Marie Antoinette must have inherited his mtDNA from Marie Antoinette.  Marie Antoinette is a maternal descendent of Empress Maria-Theresa.  The following diagram shows in purple how the mtDNA is passed down from Maria-Theresa to her descendents and which one of her descendents carry her mtDNA:

The next step:

The descendent tree of Maria-Theresa indicates that the true Louis XVII should have the same mtDNA as Marie Antoinette as well as living descendents from the same line.  By collecting the DNA sample from living family members and collecting forensic samples from deceased members, scientists can find out the expected mtDNA type for the true Louis XVII.

Who was tested to solve this case?

A DNA sample was collected from the following descendents of Maria-Theresia:

  1. A DNA sample was collected from a hair sample from Johanna-Gabriela (Marie Antoinette’s sister) - hair samples were removed from a rosary with medalions containing hair from Johanna-Gabriela.
  2. A DNA sample was collected from a hair sample from Maria-Joseph (Marie Antoinette’s sister) - hair samples were removed from a rosary with medalions containing hair from Maria-Joseph.
  3. A DNA sample from Marie Antoinette was collected from two sources.  The first source was medallions kept in a private collection in Cannes.  The second was hair that was taken from a document containing a lock of hair fixed with a silk thread belonging to Marie-Antoinette.
  4. Queen Anne of Romania’s blood sample (a living descendent of Maria-Theresia). 
  5. Andre de Bourbon Parme’s hair sample (a living descendent of Maria-Theresia).

A DNA sample was collected from the following specimens:

  1. A piece of heart tissue from the boy who died in the Temple of Paris on June 8, 1795 - the heart was removed during autopsy in 1795 and preserved in a crystal urn at the Basilique Saint-Denis.
  2. A bone sample obtained from the coffin of Carl Wilhelm Naundorff, who claimed to be Louis XVII, son of Marie Antoinette - hair samples and the right humerus, both removed from skeletal remains when Naundorff’s coffin was opened in 1950 for a different study.

Table 2:  The results of the mtDNA test:

In this study, the researchers tested the HVR1 region and HVR2 region of the mtDNA but did not test the Coding region.  Since many of the forensic samples were extremely old, the DNA was of extremely poor quality.  The quality of the DNA for each individual is listed in the table below together with the results.

Name Relationship Sample type Sample quality HVR1 Region HVR2 Region Coding Region Publication
Johanna-Gabriela Marie Antoinette’s sister Hair Poor sample quality 16519 263, 315.1 Not tested Jahaes et al 1998
Johanna-Gabriela Marie Antoinette’s sister Hair Poor sample quality 16519 152, 194, 263, 315.1 Not tested Jahaes et al 2001
Maria-Joseph Marie-Antoinette’s sister Hair Poor sample quality 16519 263, 315.1 Not tested Jahaes et al 1998
Marie Antoinette - Hair Poor sample quality 16519 263, 315.1 Not tested Jahaes et al 1998
Marie Antoinette - Hair Poor sample quality 16519 152, 263, 315.1 Not tested Jahaes et al 2001
Queen Anne of Romania Living descendent of Maria-Theresa Blood Good sample quality 16519 152, 194, 263, 315.1 Not tested Jahaes et al 1998
Andre de Bourbon Parme Living descendent of Maria-Theresa Fresh hair sample Good sample quality incomplete 152, 194, 263, 315.1 Not tested Jahaes et al 1998
Putative heart of Louis XVII From putative son of Marie Antoinette, Louis XVII Heart tissue Good sample quality 16519 152, 194, 263, 315.1 Not tested Jahaes et al 2001
Carl Wilhelm Naundorff Claims to be Marie Antoinette’s son, Louis XVII Bone sample Good sample quality 16260 263, 315.1 Not tested Jahaes et al 1998

The results for some of the ancient samples yielded partial and incomplete data.  As a result, the final mtDNA for Maria Theresa must also include the examination of living descendents for further confirmation. 

What was concluded from the results?

The ancient hair samples from Marie Antoinette and her two sisters were of extremely poor quality, and even though the samples were repeatedly tested several times by the researchers, only a partial profile could be obtained.  To supplement the findings from the ancient samples, a fresh blood sample and hair sample was obtained from living descendents of Maria Theresa, including a blood sample from Queen Anne of Romania and a hair sample from her brother, Andre de Bourbon Parme.  The final results of the study indicated the following mtDNA type for Maria Theresa and all of her true biological maternal line descendents:

HVR1 HVR2 Coding Region
16519 152, 194, 263, 315.1 Not tested

The significance of this profile is that it is the key to all maternal line descendents of Maria Theresa.  All descendents of this line, including descendents living today, must carry this mtDNA profile.

Solving the mystery

Once the mtDNA type for descendents of Maria Theresa was obtained, the next task was to compare the mtDNA profile to Carl Wilhem Naundorff:

mtDNA HVR1 HVR2 Coding Region
mtDNA type of true biological descendents of Maria Theresa’s maternal line 16519 152, 194, 263, 315.1 Not tested
mtDNA type of Carl Wilhelm Naundorff 16260 263, 315.1 Not tested

Conclusion:  The mtDNA type of Carl Wilhelm Naundorff was different from the mtDNA of true biological descendents of Maria Theresa’s maternal line, proving that Carl Wilhelm Naundorff was an imposter and not Louis XVII.

The next task was to compare the mtDNA type for the descendents of Maria Theresa to the mtDNA profile obtained from the heart of the boy who died in the Temple of Paris to see if the boy who died was truly Louis XVII.

DNA HVR1 HVR2 Coding Region
DNA type of true biological descendents of Maria Theresa’s maternal line 16519 152, 194, 263, 315.1 Not tested
DNA type of the heart from the boy who died in the Temple of Paris in 1795. 16519 152, 194, 263, 315.1 Not tested

Conclusion:  The mtDNA type of the heart from the boy who died in the Temple in 1795 was a perfect match to the mtDNA of tue biological descendents of Maria Theresa’s maternal line, indicating that the boy who died in 1795 was the true Louis XVII.

The aftermath:  What has happened since this study was completed?

Now that the mtDNA type of the descendents of Maria Theresa’s line is known, amateur genealogists from around the world have used the data to compare to their own families to see if they may have links to royalty. 

Although the original researchers, Jahaes et al, never attempted to use the data from the study to determine the haplogroup of this royal family line, amateur genealogists have tried to use the data from the study to determine the mtDNA haplogroup of Marie Antoinette, the most famous descendent of Maria Theresa, concluding that Marie Antoinette belongs to Haplogroup H.  Let’s take a look at the raw data and see how accurate they are.

Step 1:  Click here to download and print the mtDNA Haplogroup map so that you can follow along with the discussion.

Step 2:  Identify the presence and absence of HVR1 markers on the map

Marie Antoinette only has one HVR1 marker, namely 16519.  On the map, all HVR1 markers are in blue.  Starting from the CRS, move outwards and cross off all of the HVR1 markers that Marie Antoinette does not have:

The results of the HVR1 test helps to eliminate the haplogroups that Marie Antoinette definitely does not belong to, and shows that Marie Antoinette must belong to either Haplogroup, R, or Pre-HV, or HV, or H or the CRS branch of H. 

Step 3:  Identify the presence or absence of HVR2 markers on the map

Marie Antoinette had four markers in the HVR2 region, namely, 152, 194, 263, and 315.1.  On the map, all HVR2 markers are in red.  Starting from the CRS, move outwards and cross off all of the HVR2 markers that Marie Antoinette does not have and circle the HVR2 markers that she has:

The results of the HVR2 test helps to eliminates Haplogroup R, and shows that Marie Antoinette must belong to either Haplogroup, Pre-HV, or HV, or H or the CRS branch of H.  Marie Antoinette has marker 263 which brings her away from CRS.  However, the presence of 263 cannot eliminate her from CRS either (click here to view the H subclade map), as such a conclusion would require more data from the coding region. 

In conclusion, the results of the scientific studies show that Marie Antoinette can belong to any one of the following Haplogroups:

  • H
  • HV
  • Pre-HV

To confirm whether Marie Antoinette is really a member of haplogroup H, one would need to test the coding region of her mtDNA to confirm that she does not have markers 7028 and 14766.  However, no coding region data was ever available for Marie Antoinette or for any of her living or decease relatives, so there is no way to confirm that she is a member of Haplogroup H.  The absence of coding region data is not unexpected since the scientists in the original study were never aiming to determine her haplogroup, so the coding region was never investigated. 

In conclusion, it is premature to conclude that Marie Antoinette is a member of haplogroup H, but the data from the studies to date do serve to narrow down her possible haplogroups to H, HV, and Pre-HV.  Further studies would need to focus on markers 7028 and 14766 in the coding region which would provide a determination of which haplogroup Marie Antoinette actually belongs to, and if she does in fact belong to haplogroup H, then Haplogroup H subclade testing should confirm which branch of H she falls into. 

May 02 2008

The mtDNA and its role in Ancestry: Part XIII (Distribution of Subclades of H)

DNA Lesson Series: The mtDNA and its role in Ancestry
mtDNA Part I - mtDNA 101
mtDNA Part II - Facts about mtDNA
mtDNA Part III - mtDNA Structure
mtDNA Part IV - Ancestral Markers
mtDNA Part V - Detecting Mutations in the mtDNA
mtDNA Part VI - mtDNA Ancestral Markers
mtDNA Part VII - The Cambridge Reference Sequence
mtDNA Part VIII - mtDNA Test Types
mtDNA Part IX - mtDNA Haplogroup Determination
mtDNA Part X - mtDNA Subclades
mtDNA Part XI - mtDNA Haplogroup H
mtDNA Part XII - Subclades of mtDNA Haplogroup H
mtDNA Part XIII - Distribution of Subclades of H  <<– you are here
mtDNA Part XIV - Descendents of Maria-Theresa
mtDNA Part XV - Luke the Evangelist
mtDNA Part XVI - Empress Feodorovna
mtDNA Part XVII - James “Earthquake McGoon” McGovern

In the previous blog, we talked about the Subclades of mtDNA Haplogroup H and discussed what is currently known about the Subclades.  In this blog, we will discuss what is known about the geographical distribution pattern of the Subclades of H based on the latest peer reviewed research data and provide a summary reference table and map for the Subclades of H which you can download and print.

Geographical Distribution of the Subclades of H

The following reference table summarizes what is known today about the geographical distribution of the Subclades of Haplogroup H.  The studies were conducted by sampling the DNA of indigenous populations from around Europe, the Caucasus and Central Asia and determining which percentage belonged to Haplogroup H versus other Haplogroup types.  For individuals who were confirmed to belong to Haplogroup H, further analysis was performed in the coding region of the mtDNA to determine which Subclade of H they belonged to in order to derive an understanding of the geographical distribution pattern of the individual Subclades of H.

As more data on the Subclades of H become available, this table will be updated.

The following reference map illustrates how Haplogroup H is distributed throughout Europe and also summarizes the distribution pattern of the Subclades of H.

In the next blog, we will wrap up our mtDNA discussion by looking at some notable people in history who belonged to Haplogroup H.

Apr 27 2008

The mtDNA and its role in Ancestry: Part XII (Subclades of mtDNA Haplogroup H)

DNA Lesson Series: The mtDNA and its role in Ancestry
mtDNA Part I - mtDNA 101
mtDNA Part II - Facts about mtDNA
mtDNA Part III - mtDNA Structure
mtDNA Part IV - Ancestral Markers
mtDNA Part V - Detecting Mutations in the mtDNA
mtDNA Part VI - mtDNA Ancestral Markers
mtDNA Part VII - The Cambridge Reference Sequence
mtDNA Part VIII - mtDNA Test Types
mtDNA Part IX - mtDNA Haplogroup Determination
mtDNA Part X - mtDNA Subclades
mtDNA Part XI - mtDNA Haplogroup H
mtDNA Part XII - Subclades of mtDNA Haplogroup H  <<– you are here
mtDNA Part XIII - Distribution of Subclades of H
mtDNA Part XIV - Descendents of Maria-Theresa
mtDNA Part XV - Luke the Evangelist
mtDNA Part XVI - Empress Feodorovna
mtDNA Part XVII - James “Earthquake McGoon” McGovern

In the previous blog, we talked about mtDNA Haplogroup H and introduced the “Subclades of H”.  In this blog, we will discuss what is currently known about the subclades of H. 

Let’s take a closer look at the Subclades of H

Due to the large size of Haplogroup H and its wide distribution, there has been much research recently on the sub-clades of H, which surprisingly shows more restricted and regional geographic distributions. 

A study of the mtDNA of Haplogroup H individuals by examining the HVR1, HVR2 and control region of the mtDNA reveals a very large number of independent sub-branches, giving rise to subclades which have several further sub-branches themselves.  Studies to date reveal defined geographical patterns for Subclades H1 and H3.  All other subclades of H are found at a lower frequency and studies to reveal detectable geographic patterns are still ongoing. 

The following table summarizes what is known today about the subclades of Haplogroup H: (this table is based on a summary of current research published in peer reviewed journals and will be updated as more scientific data becomes available for the subclades of H)

Subclade Description
H1 (Western Europe and Slavic speaking East Europeans, H1a and H1b are found almost exclusively in Europe, with only traces found in Turks outside Europe)
  1. Largest subclade (branch) of mtDNA Haplogroup H.
  2. Represents approximately 30% of people in Haplogroup H.  13% of the total European mtDNA pool belongs to this branch.
  3. Originated in Western Europe. 
  4. Almost exclusively European.
  5. Approximately 13,000 years old.
  6. Highest frequency in Western Europe, having expanded after the LGM from the Franco-cantabrian refugium. 
  7. Spread northwards fairly close to the Atlantic coastline into the British Isles.
  8. H1 is most frequent in the Iberian Peninsula
  9. Centered in Iberia and surrounding areas, covering about 46% of local Haplogroup H lineages.  In the Near East, frequency of H1 does not exceed 6% of all Haplogroup H subclades
  10. Has a frequency peak among the maternal lineages of the Basques of Spain (27.8%).  Very high frequencies in the rest of Iberia (17.7% to 24.3%), Morocco (19.2%) and Sardinia (17.9%)
  11. Decreasing in frequency toward the northeast and southeast. 
  12. Overall gradient with peak centered at the most southwestern edge of Europe and declining frequencies towards both the northeast and southeast.
  13. The distribution of H1 provides further confirmation regarding the origins of Europeans, suggesting that the Franco-Cantabrian refuge area was the source of late-glacial expansion of hunter-gatherers that repopulated much of Central and Northern Europe from approximately 13,000 years ago.
  14. The H1b sub-branch is detected at the highest frequency in Eastern Europe (7% of haplogroup H), and North Central Europe (5% of H).  It is also found in 5% of Haplogroup H individuals in Siberian Mansis.
  15. The H1f sub-branch is a rare subclade that is found in northern central Finns.  Makes up to 25% of the Finish Haplogroup H genomes of Finnila, Lehtonen, and Majamaa and absent elsewhere in Europe.
H2 (Eastern Europe, Caucasus, Asia)
  1. H2 is found in the highest frequency in Eastern Europe and the Caucasus.
  2. Most likely dispersed from Western Europe. 
  3. H2a is found most frequently in Eastern Europe and at low frequency in Western Europe (6.5% in Eastern and 1.1% in Western European Haplogroup H genomes).  Unlike its ancestor, H2, which did not spread to Asia, the spread of H2a extends to Central Asia and mimics the phylogeography of Y-chromosomal R1a. 
  4. The H2a1 branch, which is defined by a transition at position 951 is found most frequently in Eastern Europe as well as Asia.
  5. Coalescence age falls to the period of postglacial recolonization of Europe.
  6. The mtDNA bearing the “St. Luke” motif 16235, 16291 (vernesi et al. 2001) belongs to subclade H2 and is most frequent in Germany and Scotland.  The St. Luke motif is the mtDNA motif of “Luke the Evangelist”.  Click here to download and print an original copy of the publication.
H3 (Western Europe)
  1. H3 is the second most common subclade of H.
  2. Haplogroup H gave rise to subclade H3 in Western Europe.
  3. H3 is found more frequently in the Western Europe than in the Eastern Europe.  Mainly localized in Western Europe (11.7%), having expanded after the LGM from the Franco-cantabrian refugium. 
  4. H3 is extremely rare in Eastern Europe and virtually absent in Anatolia and the Near East.  Resembles the phylogeography of the spread of Y-chromosomal Haplogroup R1b. 
  5. Its highest frequency is in the Iberian Peninsula, 17% of haplogroup H, the highest detected so far.
  6. Spread northwards fairly close to the Atlantic coastline into the British Isles.
  7. Similar pattern to H1.
  8. 9000 to 11,000 years old.
  9. The distribution of H1 provides further confirmation regarding the origins of Europeans, suggesting that the Franco-Cantabrian refuge areas was the source of late-glacial expansion of hunter-gatherers that repopulated much of Central and Northern Europe from approximately 15,000 years ago.
  10. H3 represents a much smaller fraction of H than H1. 
  11. High frequencies found among the Basques of Spain (13.9%), in Galicia (8.3%) and Sardinia (8.5%) (in the same areas where H1 is also most frequent).
H4
  1. Infrequent/minor subclade of H.
  2. Found in low frequencies in Europe and the Near East and also present in the Caucasus.
  3. Entered Europe around the LGM and/or during later dispersal from the near East, such as the Neolithic.
  4. Further research will better resolve the geographical distribution pattern of this subclade.
H5
  1. H5 is present at low levels throughout the Near East and may have evolved there, spreading later into Europe. 
  2. 11,500 years old.  Its ancestor was identified as a putative late-glacial founder type.  Found in Iberia, central, eastern and southeast Europe and less frequent in northwest Europe.
  3. H5a is found in the highest frequency on the central European plain and dates to 7000 to 8000 years.  Fairly evenly distributed at low levels across Europe, but is absent from the Caucasus and the Near East, suggesting an European origin for this branch.  Distributed by late-glacial dispersals, may trace a distinct dispersal route into Central and Eastern Europe. 
H6 (central Asia)
  1. Very old branch of H, first arose approximately 40,000 years ago, likely diverged from H in Central Asia.
  2. Although H6 is likely one of the oldest subclades of Haplogroup H, it is one of the latest subclades to expand into Europe, likely migrated into Europe approximately 5000 years ago.
  3. Diverged to Asia very early.  Coalescence age of H6 in Central Asians is very deep, 40,000 years.  Asian branch of H6 is highly divergent and seems to among the oldest in Haplogroup H.
  4. Flow of maternal lineages south of the Caspian and the Black Sea in addition to well known long lasting migrations of pastoral nomads alongside the steppe belt that connects the Danube Basin, over the Pontic-Caspian, with Central Asia, Altay, and Manchuria.
  5. Most frequent among the identified subclades of Haplogroup H in Central Asia.
  6. Common in Eastern Europe and the Caucasus (may have dispersed from western Europe).
  7. The basal type of H6 is exclusively European and there is a single derived type that is common in Eastern Europe and the Caucasus.
  8. Not found in Near Eastern samples.  
  9. The H6b branch is very rare in Europe.  Found in Central Asian/Altaian Populations.  Very distinctly divergent haplotypes.  Reflects a long-time separation of Asian and European H6 mtDNA or an earlier expansion of H6 in the eastern part to its present range. 
H7
  1. Infrequent/minor subclade
  2. Found in Europe and the Near East and also present in the Caucasus.
  3. Entered Europe around the LGM and/or during later dispersal from the near East, such as the Neolithic.
  4. Further research will reveal the history and geographical distribution pattern.
H8
  1. H8 is found mainly in Near East and Central Asia.
  2. Very rare in Europe.
  3. Flow of maternal lineages south of the Caspian and the Black Sea in addition to well known long lasting migrations of pastoral nomads alongside the steppe belt that connects the Danube Basin, over the Pontic-Caspian, with Central Asia, Altay, and Manchuria.
  4. Found in Central Asian/Altaian Populations.  Very distinctly divergent haplotypes.  Reflects a long-time separation of Asian and European H8 mtDNA or an earlier expansion of H8 in the eastern part to its present range. 
H9
  1. Infrequent/minor subclade of H.  Further research will better resolve the geographical distribution pattern of this subclade.
H10
  1. Infrequent/minor subclade of H.  Further research will better resolve the geographical distribution pattern of this subclade.
H11
  1. Infrequent/minor subclade of H.  Further research will better resolve the geographical distribution pattern of this subclade.
H12
  1. Infrequent/minor subclade of H.  Further research will better resolve the geographical distribution pattern of this subclade.
H13
  1. Infrequent/minor subclade
  2. Found in low frequencies in Europe and the Near East and also present in the Caucasus.
  3. Entered Europe around the LGM and/or during later dispersal from the near East, such as the Neolithic.
  4. Highest frequency in Daghestan and in Georgia.
  5. Present in the southern Caucasus and Near East populations
H14
  1. Infrequent/minor subclade of H.  Further research will better resolve the geographical distribution pattern of this subclade.
H15
  1. Infrequent/minor subclade of H.  Further research will better resolve the geographical distribution pattern of this subclade.
H16
  1. Infrequent/minor subclade of H.  Further research will better resolve the geographical distribution pattern of this subclade.

In the next blog, we will begin to wrap up our discussion of mtDNA Haplogroup H by providing a detailed Haplogroup H distribution map.

Apr 20 2008

The mtDNA and its role in Ancestry: Part XI (mtDNA Haplogroup H)

DNA Lesson Series: The mtDNA and its role in Ancestry
mtDNA Part I - mtDNA 101
mtDNA Part II - Facts about mtDNA
mtDNA Part III - mtDNA Structure
mtDNA Part IV - Ancestral Markers
mtDNA Part V - Detecting Mutations in the mtDNA
mtDNA Part VI - mtDNA Ancestral Markers
mtDNA Part VII - The Cambridge Reference Sequence
mtDNA Part VIII - mtDNA Test Types
mtDNA Part IX - mtDNA Haplogroup Determination
mtDNA Part X - mtDNA Subclades
mtDNA Part XI - mtDNA Haplogroup H  <<– you are here
mtDNA Part XII - Subclades of mtDNA Haplogroup H
mtDNA Part XIII - Distribution of Subclades of H
mtDNA Part XIV - Descendents of Maria-Theresa
mtDNA Part XV - Luke the Evangelist
mtDNA Part XVI - Empress Feodorovna
mtDNA Part XVII - James “Earthquake McGoon” McGovern

In the last blog, we introduced “Subclades”, which are the finer branches of the Haplogroup tree.  One of the most well studied is mtDNA Haplogroup H.  In the next few blogs, we will discuss the latest research findings for mtDNA Haplogroup H and its subclades.

Who’s who in the field of Haplogroup H Research

Studies have shown that Europeans fall into one of several main mtDNA haplogroups:  H, I, J, K, N1, T, U2e, U3, Ur, X, W, U5, and V.  Haplogroup H is one of the most dominant family groups in Europe, representing approximately 40% of the mtDNA gene pool in populations in various parts of Europe and extending as far as western Asia.  Recent publications by the following researchers have provided significant advances in our understanding of Haplogroup H and its subclades:

  • Roostalu et al. from the University of Tartu and Estonian Biocentre, Estonia
  • Brandstatter et al. from the Innsbruck Medical University, Austria
  • Pereira et al. from Universidade to Porto, Portugal
  • Grignani et al. from Universita di Pavia, Italy
  • Loogvali et al. from University of Tartu, Estonia
  • Achilli et al. from Universita di Pavia, Italy

Table 1: Top peer reviewed research publications for mtDNA Haplogroup H 

This table lists the most significant papers for Haplogroup H in peer reviewed journals, with links to access the original publications.  These papers have provided significant advances in the current understanding of Haplogroup H and form the basis for the Haplogroup H subclade test panel. 

Name of Scientific Article Scientific Journal
Origin and expansion of haplogroup H, the dominant human mitochondrial DNA lineage in West Eurasia: the Near Eastern and Caucasian perspective.  Roostalu U. et al Click here to view and download a copy of the original publication Mol Biol Evol. 2007 Feb;24(2):436-48.
Dissection of mitochondrial superhaplogroup H using coding region SNPs.  Brandstätter A et al  Click here to read abstract Electrophoresis. 2006 Jul;27(13):2541-50.
Evaluating the forensic informativeness of mtDNA haplogroup H sub-typing on Eurasian scale.  Pereira L et al Click here to read abstract Forensic Sci Int. 2006 May 25;159(1):43-50. Epub 2005 Aug 1.
Subtyping mtDNA haplogroup H by SnaPshot minisequencing and its application in forensic individual identification.  Grignani P et al Click here to read abstract Int J Legal Med. 2006 May;120(3):151-6. Epub 2005 Dec 7.
High-resolution mtDNA evidence for the late-glacial resettlement of Europe from an Iberian refugium.  Pereira L et al Click here to view and download a copy of the original publication Genome Res. 2005 Jan;15(1):19-24.
Disuniting uniformity: a pied cladistic canvas of mtDNA haplogroup H in Eurasia.  Loogväli EL et al  Click here to view and download a copy of the original publication Mol Biol Evol. 2004 Nov;21(11):2012-21. Epub 2004 Jul 14.
The molecular dissection of mtDNA haplogroup H confirms that the Franco-Cantabrian glacial refuge was a major source for the European gene pool.  Achilli A et al Click here to view and download a copy of the original publication Am J Hum Genet. 2004 Nov;75(5):910-8. Epub 2004 Sep 20.

Let’s summarize the peer reviewed findings to date for Haplogroup H:

Several of the recent papers aim to provide resolution for the distribution of Haplogroup H and its subclades and begin to answer the fundamental questions of the origins of Haplogroup H:  where did it come from and where is it most concentrated in Eurasia?

To follow is a summary of what is currently known about Haplogroup H.  As more details are confirmed, this list will be updated: 

Category What is currently known about Haplogroup H
Origins Haplogroup H originated in Near and Middle east prior to 30,000 years ago and expanded within the Near East 25,000 to 30,000 years ago.
Haplogroup H first entered Europe 20,000 to 25,000 years ago (at the peak of the ice age) in association with a second Paleolithic wave (possibly contemporary with the diffusion of the Gravettian technology 20,000 to 25,000 years ago).
19,000 to 22,000 years ago during the last glacial maximum (LGM), the climate became significantly colder and dryer.  During this cold peak, extreme deserts occupied most of Europe and Northern Asia was covered by steppe-tundra, forcing early the Paleolithic populations, consisting of Haplogroup H ancestors of Northern and Central Europe to retreat to the south, to the refugium areas in the western Caucasus and southern European peninsulas.
15,000 years ago, climatic conditions improved.  Haplogroup H was strongly involved in the late-glacial expansion from ice-age refugia after the LGM.
Due to its high frequency and wide distribution, Haplogroup H is implicated to have participated in all subsequent episodes of putative gene flow in western Eurasia (such as the Neolithic diffusion of agriculture from the Near East, the expansion of the Kurgan culture from southern Ukraine, the recent events of gene flow to northern India).
Distribution Wide geographic distribution
Haplogroup H is the most common and frequent haplogroup in European Caucasian populations (western Eurasia)
High frequency.  Haplogroup H accounts for approximately 40% of total mtDNA pool variation for most of Europe.  Most prevalent haplogroup in all European populations except the Saami
Haplogroup H exhibits a characteristic distribution pattern of Northwest to Southeast.  The frequency of Haplogroup H is highest in the Northwest and declines towards East and South, reaching 20% in the Near East and Caucasus, <10% in the Gulf and 5% to 10% in Northern India and Central Asia.

Despite the broad geographic distribution pattern of Haplogroup H, further investigation of each Subclade of Haplogroup H provides further resolution and reveals a much more specific and distinctive distribution pattern for each Subclade of Haplogroup H.

Haplogroup H is the most prominent European Haplogroup, and the papers were able to successfully sub-classify members in Haplogroup H into subclades H1 to H16 based on characteristic SNPs in the mtDNA, many of which are located in the coding region.  Next, let’s talk about the Subclades of Haplogroup H.

Subclades of Haplogroup H

The sub-clades of Haplogroup H surprisingly show more restricted and disctinctive regional geographic distributions. 

Based on the recent papers for defining the subclades of Haplogroup H, a subclade test panel is now available to test for sub-clades H1 to H16.  Individuals who are confirmed to belong to the Haplogroup H family can now take the H Subclade SNP Test to find out which sub-clade of Haplogroup H they belong to. 

Subclades H1 to H16 account for over 70% of individuals who belong to Haplogroup H.  The remaining 30% of individuals who belong to Haplogroup H belong to yet unidentified subclades of Haplogroup H.  As the studies progress, more subclades of H will be identified and will provide further classification.

The subclades of Haplogroup H can be defined by the following panel of coding region SNPs:

Location of SNP Mutation Haplogroup H Subclade
1438 A>G H2
2259 C>T H13
3010 G>A H1
3915 G>A H6
3936 C>T H12
3992 C>T H4
4310 A>G H9
4336 T>C H5a
4769 A>G H2a
4793 A>G H7
6253 T>C H15
6776 T>C H3
8448 T>C H11
10394 C>T H16
11377 G>A H14
13101 A>C H8
14470 T>A H10

The mtDNA Haplogroup H Subclade Tree:

The 16 subclades of Haplogroup H can be summarized in the following phylogenetic tree.  Click here to view a more detailed version of the mtDNA Haplogroup H Subclade Tree.

 

In the next blog, we will discuss the features and distribution pattern of each subclade of mtDNA Haplogroup H.

Apr 15 2008

The mtDNA and its role in Ancestry: Part X (mtDNA Subclades)

DNA Lesson Series: The mtDNA and its role in Ancestry
mtDNA Part I - mtDNA 101
mtDNA Part II - Facts about mtDNA
mtDNA Part III - mtDNA Structure
mtDNA Part IV - Ancestral Markers
mtDNA Part V - Detecting Mutations in the mtDNA
mtDNA Part VI - mtDNA Ancestral Markers
mtDNA Part VII - The Cambridge Reference Sequence
mtDNA Part VIII - mtDNA Test Types
mtDNA Part IX - mtDNA Haplogroup Determination
mtDNA Part X - mtDNA Subclades  <<– you are here
mtDNA Part XI - mtDNA Haplogroup H
mtDNA Part XII - Subclades of mtDNA Haplogroup H
mtDNA Part XIII - Distribution of Subclades of H
mtDNA Part XIV - Descendents of Maria-Theresa
mtDNA Part XV - Luke the Evangelist
mtDNA Part XVI - Empress Feodorovna
mtDNA Part XVII - James “Earthquake McGoon” McGovern

In mtDNA Part IX, we discussed how to use your mutations to determine your Haplogroup.  In this section, we will show you how you can trace your ancestry further by using your mutations to determine your “Subclade”. 

What are Subclades?

All people living today can trace their maternal ancestry back to one of 26 core mtDNA Haplogroups.  Haplogroups are the main “trunks” of the mtDNA phylogenetic tree and represent extremely ancient family groups which arose tens of thousands of years ago.  Over time, the descendents of each Haplogroup formed further subgroups, called “Subclades”.  Once you discover which Haplogroup you belong to, you can further fine tune your results by tracing which sub-branch of your Haplogroup you belong to through “Subclade” analysis. 

Subclades are named using numbers and letters.  For example, Subclades of Haplogroup H include H1, H2, H3, H4,…. and so on.  The Subclade H2 can be further classified as H2a, H2b, etc.  Similarly, the Subclade H5 can be further classified as H5a, H5b, etc.

Which mtDNA Subclade tests are available? 

Subclade testing will be launched over the next few months.  Refer to the following chart for the launch dates of upcoming mtDNA Subclade test panels.  Once the subclade test is available, it will appear automatically as an upgrade option from your control panel.

mtDNA Subclade Test Type # of SNPs included in test panel Launch Date
Haplogroup H Subclade Test 17 July 21, 2008
Haplogroup M Subclade Test 19 TBA
Haplogroup R Subclade Test 12 TBA

Please remember the following rules for subclade testing:

  1. You can only take a subclade test if your Haplogroup is known (either confirmed, or strong prediction)
  2. You can only take a subclade test for your own Haplogroup.  For example, if you belong to Haplogroup T, you will not qualify for Haplogroup H subclade testing. 
  3. If a Subclade test is not yet available for your subclade, check back occasionally as the control panel is updated as the science progresses.  Once more information is known about your Haplogroup and its Subclades, it will automatically be announced here.

Coming up next, H Subclades!

Since the next Subclade test to become available is the mtDNA Haplogroup H Subclade SNP Test, in the next blog, we will discuss Haplogroup H, and provide an overview of current scientific research and discoveries for Haplogroup H. 

In preparation for the next blog, we recommend downloading and printing the Haplogroup H Subclade Map.  You will need this map when we talk about Haplogroup H subclades in the next blog

Apr 09 2008

The mtDNA and its role in Ancestry: Part IX (mtDNA Haplogroup Determination)

DNA Lesson Series: The mtDNA and its role in Ancestry
mtDNA Part I - mtDNA 101
mtDNA Part II - Facts about mtDNA
mtDNA Part III - mtDNA Structure
mtDNA Part IV - Ancestral Markers
mtDNA Part V - Detecting Mutations in the mtDNA
mtDNA Part VI - mtDNA Ancestral Markers
mtDNA Part VII - The Cambridge Reference Sequence
mtDNA Part VIII - mtDNA Test Types
mtDNA Part IX - mtDNA Haplogroup Determination  <<– you are here
mtDNA Part X - mtDNA Subclades
mtDNA Part XI - mtDNA Haplogroup H
mtDNA Part XII - Subclades of mtDNA Haplogroup H
mtDNA Part XIII - Distribution of Subclades of H
mtDNA Part XIV - Descendents of Maria-Theresa
mtDNA Part XV - Luke the Evangelist
mtDNA Part XVI - Empress Feodorovna
mtDNA Part XVII - James “Earthquake McGoon” McGovern

In mtDNA Part VI, we discussed two ways that mtDNA testing can be used to trace ancestry: 

  1. Direct Comparisons - search for matches, confirm or refute findings from ancestral studies
  2. Ancestral Tracking - tracing your deep ancestry through “haplogroup” determination

 In today’s blog, we will focus on #2 “Ancestral Tracking”, and show you the science behind how the mutations in your mtDNA are used to determine your haplogroup “deep ancestry”.  We will always automatically predict your haplogroup for you after you take the mtDNA test.  However, the more you understand the science behind the technology, the more you will get out of your genetic genealogy experience.  New studies and data become available all the time in this fast moving and exciting field, so it is a good idea to know the basics behind how the technology works.

Let’s begin with a basic step-by-step guide on how to use your mutations in your mtDNA to determine your mtDNA haplogroup:

Step #1:  Download and print the Haplogroup Reference Guide Click here to download and print the mtDNA Haplogroup Reference Guide.  This guide is a valuable reference tool for you.
Step #2:  Gather your mutations Determine which mtDNA tests you have taken and which ones you did not take yet.

  • The HVR1 Test will show you all mutations between 16000 to 16400 (shown in blue).
  • The HVR2 Test will show you all mutations between 1 to 400 (shown in red).
  • The SNP Haplogroup Backbone Test will show you all relevant mutations in the Coding Region (shown in black).

 

Step #3:  Identify your mutations on the Reference Map Your mtDNA results report will always include a mutation table.  Examine your mutations table and circle your mutations on the Reference Map.

  • Starting from the CRS, circle the mutations that you have. 
  • Follow the path of your mutations away from the CRS. 
  • Ignore the markers that you did not test.
  • The final destination of the path mutations is the haplogroup that you belong to.

Let’s take a look at an example for a sample individual, Scott Mckenzie:

Let’s assume that Scott has only taken the HVR1 Test.  Scott has not yet taken the HVR2 or SNP Haplogroup Backbone Tests.  This is an example of an actual mutation table for Scott’s HVR1 Test:

Let’s see if we can tell which haplogroup Scott belongs to by examining the mutations in his HVR1 region.

1.  Find the CRS on the Reference Guide.  The first marker you encounter when moving away from CRS is “263″.  263 is located in the HVR2 region (HVR2 includes locations 1 to 400), so it is included in the HVR2 Test.

  • Tip #1:  If you took the HVR2 Test, then look at your HVR2 mutation results and see if you have a mutation at location 263.  If you have a mutation at 263, then you can trace your ancestry away from CRS towards “Haplogroup H”.  If you do not have a mutation at 263, then you stay within CRS, and your haplogroup is likely the same as CRS.
  • Tip #2:  If you did NOT take the HVR2 Test, then ignore 263 and move on to the next marker.

Scott did not take the HVR2 Test, so we can skip this marker for now because we do not know whether Scott has a mutation at 263.

2.  The next marker is “7028″.  7028 is located in the Coding Region, so it is included in the SNP Haplogroup Backbone Test.

  • Tip #1:  If you took the SNP Haplogroup Backbone Test, look at your results and see if you have a mutation at location 7028.  If you have a mutation at 7028, then you can trace your ancestry away from Haplogroup H towards “Haplogroup HV”.  If you do not have a mutation at 7028, but you do carry a mutation at 263, then you belong to Haplogroup H. 
  • Tip #2:  If you didn’t take the SNP Haplogroup Backbone Test, then ignore 7028 and move on to the next marker.

Scott did not take the SNP Haplogroup Backbone Test yet, so we can skip this marker for now because we do not know if Scott has a mutation at 7028.

3.  Next are markers 14766, 16067, 16298 and 72:

14766 leads to Haplogroup Pre-HV.  14766 is located in the Coding region and included in the SNP Haplogroup Backbone Test.

  • Tip:  If you carry the 14766 marker, but not 16067, 16298 or 72, then you can trace your ancestry away from Haplogroup HV towards Haplogroup Pre-HV.

Scott did not take the SNP Haplogroup Backbone Test yet, so we do not know if he has a mutation at 14766.

16067 leads to Haplogroup HV1.  16067 is located in the the HVR1 region (HVR1 includes locations 16000 to 16400), so it is included in the HVR1 Test.

  • Tip:  If you carry the 16067 marker, but not 14766, 16298 or 72, then you can trace your ancestry away from Haplogroup HV towards Haplogroup HV1.

Scott’s mtDNA HVR1 results indicate that he does not have a mutation at location 16067, so he is unlikely to belong to Haplogroup HV1.

16298 and 72 lead to Haplogroup Pre-V.  16298 is located in the HVR1 region so it is included in the HVR1 Test.  72 is located in the HVR2 region so it is included in the HVR2 Test. 

  • Tip:  If you carry a mutation at 16298 and 72, but not 14766 or 16067, then you can trace your ancestry away from Haplogroup HV towards Haplogroup Pre-V.

Scott’s mtDNA HVR1 results indicate that he does not have a mutation at 16298, so he is unlikely to belong to Haplogroup Pre-V.

Scott did not take the HVR2 test, so we do not know if he has a mutation at 72, but based on the HVR1 results, we can predict that he is unlikely to belong to Haplogroup Pre-V.

In summary, based on a process of elimination (HV1 and Pre-V are eliminated), Scott’s ancestry can be traced through to Haplogroup Pre-HV.

4.  Next are markers 11719, 73, 16126, and 16362:

11719 and 73 lead towards Haplogroup “R”.  Both 11719 and 73 are located in the coding region.  Scott did not take the SNP Haplogroup Backbone Test, so we do not know if he has mutations at 11719 and 73.

16126 and 16362 lead to Haplogroup “Pre-HV1″ and they are included in the HVR1 Test.  Scott’s mtDNA HVR1 results indicate that he has a mutation at 16126, but does not have the 16362 mutation.  It is remotely possible that Scott might belong to Haplogroup “Pre-HV1″, and had a “back-mutation” at location 16362 (we will explain back-mutations in another blog).  Back mutations are rare, but it would explain why Scott has 16126 but not 16362.  We should hold onto Pre-HV1 and continue tracing to see if there is a better match for Scott.

5.  Next are markers 11251, 16126, 16278, 16311, 16071, 151, 12308, 16189, 10310, 16304, 249:

11251 and 16126 lead to Haplogroup JT.  11251 is located in the coding region, and 16126 is located in the HVR1 region.

Scott did not take the SNP Haplogroup Backbone Test, so we do not know if he carries the 11251 mutation.  Scott’s HVR1 results indicate that he carries the 16126 mutation so we can move through to Haplogroup JT.

16278 and 16311 lead to Haplogroup R1.  Scott does not have either of these mutations, so he is unlikely to belong to Haplogroup R1.

16071 and 152 lead to Haplogroup R2.  Scott does not have a mutation at 16071 so he is unlikely to belong to Haplogroup R2.  Scott did not take the HVR2 test, so we do not know if he has a mutation at 152. 

12308 leads to Haplogroup U.  Scott did not take the Haplogroup Backbone test, so we do not know if he belongs to haplogroup U, but we can look beyond Haplogroup U to the U Subclades to see if he falls into any of the Subclades of Haplogroup U:

  • 16249 and 285 lead to Subclade U1.  Scott does not have a mutation at 16249 so he is unlikely to belong to U1.
  • 16051 and 16219 lead to U2.  Scott does not have a mutation at either locations, so he is unlikely to belong to U2.
  • 16343 and 150 lead to U3.  Scott does not have a mutation at 16343 so he is unlikely to belong to U3. 
  • 16356 and 195 lead to U4.  Scott does not have a mutation at 16356 so he is unlikely to beong to U4.
  • 16270 leads to U5.  Scott does not have a mutation at 16270 so he is unlikely to belong to U5.
  • 16172 and 16219 lead to U6.  Scott does not have either mutations so he is unlikely to belong to U6. 
  • 16318 leads to U7.  Scott does not have a mutation at 16318, so he is unlikely to belong to U7.
  • 16224 and 16311 lead to K.  Scott does not have a mutation at either 16224 or 16311 so he is unlikely to belong to K.

Thus, we can conclude that Scott does not belong to any of the Subclades of the Haplogroup U family.  If the Haplogroup Backbone Test was conducted, we could confirm that Scott does not carry 12308 and eliminate him Haplogroup U entirely. 

6.  Next are markers 16069 and 16294:

16069 leads to Haplogroup J.  16069 is included in the HVR1 Test.  Scott’s results show that he does not have a mutation at 16069, so he is unlikely to belong to Haplogroup J.

16294 leads to Haplogroup T.  16294 is included in the HVR1 Test.  Scott’s results show that he has a mutation at 16294, so we can move through to Haplogroup T.

7.  Next are markers 16163, 16186, 16189, 16304, and 16324:

16163, 16186 and 16189 lead to Subclade T1 (a branch of Haplogroup T).  All 3 markers are included in the HVR1 Test.  Scott’s results show that he has a mutation at all 3 locations, suggesting that he belongs to Haplogroup T1.

16304 leads to Subclade T2.  This marker is included in the HVR1 Test.  Scott’s results show that he does not have a mutation at 16304, so he is unlikely to belong to Subclade T2.

16324 leads to Subclade T4.  This marker is included in the HVR1 Test.  Scott’s results show that he does not have a mutation at 16324, so he is unlikely to belong to Subclade T4.

In summary, based on the results of the HVR1 Test, Scott most likely belongs to Haplogroup T1. 

The prediction strength is strong for the following reasons:

  • There are multiple markers in the HVR1 region that lead from CRS to Haplogroup T1.  HVR1 Testing indicates that Scott carries all of the mutations leading from CRS to Haplogroup T1.
  • By process of elimination, most of the other haplogroups have been eliminated, with the exception of Haplogroup B.  However, due to the large number of matching markers for Haplogroup T1, it is most likely that Scott belongs to Haplogroup T1.  This prediction can be further strengthened if Scott takes the SNP Haplogroup Backbone Test which includes marker 11251 which is specific for Haplogroup JT branch of the haplogroup tree.  If Scott is positive for a mutation at 11251, then it would further confirm that Scott belongs to Subclade T1.

The study also showed a single match at 16126 for Haplogroup Pre-HV1, but Scott does not have the 16362 marker which is usually found in Haplogroup Pre-HV1.  Due to the stronger match for Subclade T1, it is far more likely that Scott belongs to Haplogroup T1 rather than Haplogroup Pre-HV1.  To further confirm this, the SNP Backbone Haplogroup Test will confirm whether Scott carries the mutation at 11719.  The HVR2 test will confirm whether Scott carries the mutation at 73 which leads away from Pre-HV1 towards the direction of Haplogroup T. 

Haplogroup Confirmation:

If Scott indeed belongs to Subclade T1, as suggested thus far by the HVR1 test, then the HVR2 Test and the SNP Haplogroup Backbone Test will further confirm the following:

  • The results of Scott’s SNP Haplogroup Backbone Test are expected to show positive mutations for the following markers:  7028, 14766, 11719 and 11251.
  • The results of Scott’s HVR2 Test are expected show positive mutations for the following markers:  263, 73.

As you can see from this example, Scott’s haplogroup is predicted using his HVR1 results.  In his case, the prediction was quite strong because Haplogroup T1 contains a lot of mutations from HVR1.  The prediction can be confirmed with the HVR2 test and SNP Haplogroup Backbone Test.

We hope that this lesson will give you a good insight into how mutations are used to determine an individual’s haplogroup.  In the next blog, we will dig deeper into the use of mutations for deep ancestral analysis. 

Apr 05 2008

The mtDNA and its role in Ancestry: Part VIII (mtDNA Test Types)

DNA Lesson Series: The mtDNA and its role in Ancestry
mtDNA Part I - mtDNA 101
mtDNA Part II - Facts about mtDNA
mtDNA Part III - mtDNA Structure
mtDNA Part IV - Ancestral Markers
mtDNA Part V - Detecting Mutations in the mtDNA
mtDNA Part VI - mtDNA Ancestral Markers
mtDNA Part VII - The Cambridge Reference Sequence
mtDNA Part VIII - mtDNA Test Types  <<– you are here
mtDNA Part IX - mtDNA Haplogroup Determination
mtDNA Part X - mtDNA Subclades
mtDNA Part XI - mtDNA Haplogroup H
mtDNA Part XII - Subclades of mtDNA Haplogroup H
mtDNA Part XIII - Distribution of Subclades of H
mtDNA Part XIV - Descendents of Maria-Theresa
mtDNA Part XV - Luke the Evangelist
mtDNA Part XVI - Empress Feodorovna
mtDNA Part XVII - James “Earthquake McGoon” McGovern

In this blog, we will provide a broad overview of the different types of mtDNA tests available, and in the next blog, we go over some case studies to help you understand when they are used, and what each one will tell you about your ancestry.

DNA Testing 101:  There are 4 main types of mtDNA tests:

  1. HVR1 Test
  2. HVR2 Test
  3. SNP Haplogroup Backbone Test
  4. SNP Subclade Test

Let’s take a look at the main differences between these test types:

Test Type #1:  HVR1 Test

Overview:  The HVR1 Test is the most informative mtDNA test, and it is always the first test that is performed when you start tracing your maternal ancestry (the Maternal mtDNA test). The HVR1 test uses “DNA sequencing” technology to read all of the nucleotides from locations 16,000 to 16,400 of your mtDNA.  This is the entire HVR1 region, located in the D-Loop of the mtDNA.

Highlights of the HVR1 Test:  The HVR1 region is considered the most informative region of the mtDNA for ancestral studies for a number of reasons:

  1. The HVR1 region contains an abundance of markers.  The HVR1 region is located in the D-Loop, so it contains an extremely high concentration of mutations (ancestral markers).  Thus, this region is highly informative.
  2. The HVR1 region is easy to test.  The entire HVR1 region can be easily tested using sequencing technology.  All 400 nucleotides in the entire HVR1 region can be read from a single test.
  3. The HVR1 region is well studied.  The HVR1 region is the most well studied region of the mtDNA due to its high concentration of mutations and ease of testing.  Most scientific studies to date, including indigenous data and other anthropological studies, have focused mainly on the HVR1 region.  Thus, there is more scientific data available for markers in the HVR1 region than any other region of the mtDNA, making the HVR1 region by far the most informative region of the mtDNA.

Prerequisite:  There is no prerequisite for taking the HVR1 test.  The HVR1 test is aways the first and most fundamental test that is performed when using mtDNA to trace ancestry.  The HVR1 test can be used “stand-alone” for User search and comparisons and haplogroup predictions.  All of the other test types serve to supplement the results of the HVR1 test.  There are usually enough markers and hypervariablity within the HVR1 region alone to differentiate between different individuals who are not part of the same haplogroup or family line.  Also, due to the large number of highly informative markers found in the HVR1 region, your haplogroup can often be predicted from examing just the markers in your HVR1 region. 

Test Type #2:  HVR2 Test

Overview:  Like HVR1 testing, the HVR2 test also uses “DNA sequencing” technology.  This test focuses on reading all of the nucleotides from locations 1 to 400 of the mtDNA.  This is the entire HVR2 region, the second most important region in the D-loop of the mtDNA. 

Highlights of the HVR2 Test:  Like HVR1, the HVR2 region is also located in the D-Loop of the mtDNA so it contains many ancestral markers.  The HVR2 region is always tested in conjunction with or subsequent to HVR1 Testing.  The HVR2 test will supplement the HVR1 results in the following ways:

  1. Strengthen the results of User searches and comparisons.  The more regions of the mtDNA that are used for comparison, the more stringent and precise the results of the comparison.  For example, in cases where people are matching exactly at the HVR1 region, further comparison of the HVR2 region will provide greater resolution, and further confirm or refute the matches found in the HVR1 region. 
  2. Assist in haplogroup prediction.  The HVR2 region contains quite a few markers which are important for haplogroup determination, and when used in conjunction with HVR1, will provide a stronger haplogroup prediction (we will examine a case study in the next blog).

Prerequisite:  The HVR1 test is a prerequisite for taking the HVR2 test.  The HVR2 region is rich in ancestral markers and HVR2 testing is an excellent way to supplement the results of the HVR1 Test.

Test Type #3:  SNP Haplogroup Backbone Test

Overview:  While the markers in the HVR1 and HVR2 Regions can be easily detected by sequencing, the Coding Region is extremely large, making sequencing impractical.  The SNP Haplogroup Backbone Test is a special panel of approximately 20 markers in the Coding Region which are specific for haplogroup determination.  By testing just the HVR1 or HVR2, you will only receive part of the “picture” of your mtDNA.  The SNP Haplogroup Backbone Test will allow you to view markers in the Coding Region of your mtDNA.

The following table illustrates all of the SNPs that are examined in the SNP Haplogroup Backbone Test:

SNP Location Mutations
2352 T > C
3594 C > T
3693 G > A
4312 C > T
4580 G > A
4833 A > G
5178 C > A
C > T
7028 C > T
7055 A > C
A > G
7598 G > A
8618 T > C
10086 A > G
10310 G > A
10400 C > T
10873 T > C
11251 A > G
11719 G > A
12308 A > G
12705 C > T
14766 C > T

Prerequisite:  The HVR1 and HVR2 tests are prerequisites for the SNP Haplogroup Backbone Test.  The SNP Haplogroup Backbone test examines markers in the Coding Region of the mtDNA and together with results of the HVR1 and HVR2 test, will allow you to confirm your haplogroup.   

Test Type #4:  SNP Subclade Test

Overview:  The SNP Subclade Test examines a special panel of markers in the Coding Region of the mtDNA which allows you to determine which “sub-clade” you belong to once your Haplogroup has been determined.

Prerequisite:  The HVR1, HVR2, and SNP Backbone Tests are prerequisites for the SNP Subclade Test.  Your Haplogroup must be confirmed before you can proceed with SNP Subclade testing.  At the moment, SNP Subclade Tests are available for the following mtDNA Haplogroups:

  • R
  • M
  • H

If a subclade test is not available for your haplogroup, please check back often as new sub-clade tests are added occasionally.  Whenever a subclade test for your haplogroup becomes available, you will be able to access it from your control panel. 

The four types of mtDNA Tests are summarized in the following table:

Name of Test Technology Used Location Tested Region SNPs detected by this test type Prerequisite
HVR1 Test DNA Sequencing 16000 to 16400 HVR1 region of D-Loop This test will detect all SNPs which are between locations 16000 to 16400 None
HVR2 Test DNA Sequencing 1 to 400 HVR2 region of D-Loop This test will detect all SNPs which are between locations 1 to 400 Must have:HVR1 Test
mtDNA Backbone SNP Test SNP Panel Special panel of 10 to 20 SNPs Coding Region This test will detect 10 to 20 specially selected SNPs in the coding region which are known to provide information for Haplogroup determination Must have: HVR1 Test and HVR2 Test 
mtDNA Subclade SNP Test SNP Panel Special panel of 10 to 20 SNPs Coding Region This test will detect 10 to 20 specially selected SNPs located in the coding region which are known to provide information for subclade determination Must have HVR1 Test, HVR2 Test and mtDNA Backbone SNP Test.  Note:  Results of Backbone SNP Test must be available before Subclade testing can be ordered.  This is because there is a specific subclade test for each Haplogroup so unless your haplogroup has been confirmed, you will not know which sub-clade test to order.

The main goal of mtDNA testing is to gain a complete understanding of all of the important SNP mutations in your mtDNA.  The HVR1 test focuses on markers in the HVR1 region, the HVR2 test focuses on markers in the HVR2 region, and the SNP Test Panels focus on relevant markers in the Coding Region.  In the next blog, we will go over a case study to allow you to understand how these tests work together to uncover your ancestry.

Next »

Navigate Home | Family Tree | Friends | DNA Ancestry | Photos | Journals | Groups | My Profile
More Info Tutorials | News | Learning Center | Resources | About Genebase | Store | Applications | Terms
Get Help Help Center
© 2003 − 2008 Genebase Systems. All Rights Reserved.
Powered by WP.