Genebase Bionet Builder
Advertisement

The Genebase Learning Center

Archive for the 'Y-DNA STR Markers' Category

Jan 07 2009

Genebase Y-DNA STR Marker Table Version 3.5

Published by Genebaseteam under Y-DNA STR Markers

The following table outlines the Y-DNA STR markers available at Genebase and the corresponding motifs used for allele designation in Version 3.5. 

Click here to view the conversion chart for Version 3.3 to Version 3.4

Click here to view the conversion chart for Version 3.4 to Version 3.5

Please note that Users from previous versions will be automatically converted to 3.5.

Y-STR marker

Motif

Comparison to NIST nomenclature
DYS19 [TAGA]3tagg[TAGA]n same motif
DYS385a/b [aagg]6-7[GAAA]n same motif
DYS388 [ATT]n same motif
DYS389 l/ll I:  [TCTG]3[TCTA]n      II: [TCTG]n[TCTA]nN28  [TCTG]3[TCTA]n       same motif
DYS390 (tcta)2[TCTG]n[TCTA]n[TCTG]n [TCTA]ntca(tcta)2 same motif
DYS391 (tctg)3[TCTA]n same motif
DYS392 [TAT]n same motif
DYS393 [AGAT]n same motif
DYS413a/b [CA]n ?
DYS426 [GTT]n same motif
DYS434 (taat)1-2[CTAT]n same motif
DYS435 [TGGA]n same motif
DYS436 [GTT]n same motif
DYS437/DYS457 [TCTA]n[TCTG]n[TCTA]4 same motif
DYS438 [TTTTC]1[TTTTA]0-1[TTTTC]n same motif
DYS439 [GATA]n same motif
DYS441 [TTCC]n same motif
DYS442 (tatc]2(tgtc)3[TATC]n same motif
DYS444 [TAGA]n same motif
DYS445 [TTTA]n same motif
DYS446 [TCTCT]n same motif
DYS447 [TAATA]n[TAAAA]n[TAATA]n [TAAAA]n[TAATA]n same motif
DYS448 [AGAGAT]nN10(agagat)3N14 [AGAGAT]n same motif
DYS449 [TTTC]nN50[TTTC]n same motif
DYS450 [TTTTA]n same motif
DYS452 [TATAC]2[TGTAC]2[TATAC]n[CATAC]1[TATAC]1[CATAC]1  [TATAC]3-4[CATAC]0-2        [TATAC]0-3[CATAC]1[TATAC]3 same motif
DYS453 [AAAT]n same motif
DYS454 [AAAT]n same motif
DYS455 [AAAT]n same motif
DYS456 [AGAT]n same motif
DYS458 [GAAA]n same motif
DYS459a/b [TAAA]n same motif
DYS460 [ATAG]n same motif
DYS461 [TAGA]n[CAGA]n same motif
DYS462 [TATG]n same motif
DYS463 [AAAGG]n[AAGGG]n[AAGGA]2 same motif
DYS464a/b/c/d [CCTT]n same motif
DYS468 [TCC]n[TTC]n[(TTA)(TTC)]n[TTC]n ?
DYS472 [AAT]n ?
DYS481 [CTT]n same motif
DYS484 [AAT]n ?
DYS485 [TTA]n same motif
DYS487 [ATT]n ?
DYS490 [TTA]n same motif
DYS492 [ATT]n same motif
DYS494 [TAT]n ?
DYS495 [AAT]n same motif
DYS504 [TCCT]n same motif
DYS505 [TCCT]n same motif
DYS508 [TATC]n same motif
DYS511 [GATA]n ?
DYS518 [AAAG]n[GAAG]n[AAAG]n [GGAG]n[AAAG]n ?
DYS520 [ATAG]n[ATAC]n same motif
DYS522 [GATA]n same motif
DYS525 [TAGA]n same motif
DYS527a/b [GAAA]n[GGAA]n same motif
DYS531 [AAAT]n same motif
DYS532 [CTTT]n same motif
DYS533 [ATCT]n same motif
DYS534 [CTTT]n same motif
DYS537 [TCTA]n ?
DYS540 [TTAT]n same motif
DYS549 [AGAT] same motif
DYS556 [AATA]n same motif
DYS557 [TTTC]n same motif
DYS565 [TAAA]n same motif
DYS568 [AAAT]n ?
DYS570 [TTTC]n same motif
DYS572 [AAAT]n same motif
DYS575 [AAAT]n same motif
DYS576 [AAAG]n same motif
DYS578 [AAAT]n ?
DYS588 [TTGTA]n[TTGCA]n[TTGTA]n [TTGCA]n[TTGTA]n ?
DYS590 [TTTTG]n ?
DYS594 [TAAAA]n same motif
DYS607 [AAGG]n same motif
DYS612 [CCT]n[CTT]n[TCT]n[CCT]n[TCT]n ?
DYS614 [CTT]n[CCT]n[CTT]n[CCT]n[CTT]n [CCT]n[CTT]n[CCT]n[CTT]n[CCT]n [CTT]n[CCT]n[CTT]n[CCT]n[CTT]n ?
DYS617 [TTA]n ?
DYS626 [AAAG]n same motif
DYS632 [CATT]n same motif
DYS635/GATA C4 [TCTA]4[[TGTA]2[TCTA]2[TGTA]2 [TCTA]2[TGTA]0,2[TCTA]n same motif
DYS640 [AAAT]n ?
DYS641 [TAAA]n same motif
DYS643 [CTTTT]n same motif
DYS644 [TTTTA]n ?
DYS710 [AAAG]n same motif (different counting method)
DYS711 [CTT]n[CTTT]n[CTT]n ([CTC][CTT])compound[CTCCTTT]n[CTCCTA]n[CTT]n[CTC]n[CTT]n ?
DYS712 [AGAT]n[AGAC]n same motif
DYS713 [TCTT]ntc(TCTT)2tctgtcttttttctttct [CTTT]n(TCTT)5tcc(TTCT)2 (CTTTT)2ttta(TTTAT)4 ?
DYS714 [TTTCT]n[CTTCT]n[TTTCT]n [CTTCT]n[TTTCT]n same motif
DYS716 [CCATT]n ?
DYS717 [TGTAT]n[TATTG]n[TACTG]n [TATTG]n different motif
DYS724a/b [AAAG]nN5[AAGG]n ?
GATA A10 [TAGA]n same motif
GATA H4 [TAGA]n same motif
YCAII a/b [CA]n same motif

 *Variable regions in each motif are indicated in blue.

*Motifs used for allele designation in Version 3.5 are compared to NIST nomenclature, and marked as “same” or “different” accordingly. 

*Question mark (?) indicates that NIST data is currently unavailable.

What is “NIST nomenclature”?

NIST stands for “National Institute of Standards and Technology”.  NIST is a non-regulatory federal agency within the U.S. Department of Commerce with the mission to promote U.S. innovation and industrial competitiveness by advancing measurement science, standards, and technology in ways that enhance economic security and improve our quality of life. 

NIST has compiled and maintained a Short Tandem Repeat DNA Internet Database called STRBase, a valuable STR information resource.  Click here to view the NIST standards for the most commonly tested Y-chromosome STR markers

To facilitate Y-DNA STR data comparisons between different laboratories and research groups, many laboratories choose to follow the standard NIST nomenclature to avoid differences in allele values due to different nomenclatures (motif and counting method) between different laboratories.  Most motifs in the NIST nomenclature follow the ISFG (International Society of Forensic Genetics) guidelines.  Click here to view the latest ISFG recommendations on the Y-STR nomenclature.

Mar 18 2008

Understanding the different types of Y-DNA STR Markers

Published by Genebaseteam under Y-DNA STR Markers

The Y-DNA Test examines several different STR Marker Types:

A. Single-Copy Markers

Single-copy markers are DNA markers that occur only once in the human genome, resulting in one allele value for the marker.

B. Multi-Copy Markers

Markers DYS385, DYS459, DYS464 and YCAII are multi-copy Y-DNA STR markers that typically have two copies.

Marker DYS464 can occur in 4 to 7 copies in the human genome and the method for calculating genetic distance for DYS464 differs from the method used for other multi-copy markers.

Multi-copy markers are genetic/DNA markers that occur more than once (ie more than one copy) in the human genome, resulting in different allele values for each copy. For example, the markers DYS385, DYS459 and YCAII are typically present at two different locations on the Y-chromosome; therefore, they are also termed “duplicated markers”. For each multi-copy marker, the same primer pair binds to different locations on the Y-chromosome, thereby amplifying more than one region simultaneously, resulting in more than one allele value for that marker. The allele values for each copy are not reported in any specific order, as the exact order of copies cannot be determined, but typically, the smaller allele value is reported first, followed by the larger allele value.

C. Special Multi-Copy Marker DYS389

DYS389 is a special marker. Unlike other multi-copy markers, only one location is amplified. The forward primer for DYS389 binds at a specific location on the Y-chromosome, whereas the reverse primer binds at two different locations. Such amplification yields two PCR products:  the shorter DYS389I fragment and the longer DYS389II fragment.

Mar 18 2008

How is genetic distance calculated for Y-DNA STR markers

Published by Genebaseteam under Y-DNA STR Markers

Genetic distance is defined as the total difference in allele values of different genetic markers between two individuals. The smaller the value of the genetic distance, the closer two individuals are related, and the more recently they shared a common ancestor (TMRCA). The method used to determine genetic distance for four different Y-DNA STR marker types is explained below.

A. Calculation of genetic distance for single-copy markers

For single-copy markers, the calculation is straightforward. The genetic distance for each single copy marker between two individuals is the absolute value of the difference between the value of the markers:

The total genetic distance between two individuals is the sum of the genetic distances of all markers compared.

B. Calculation of genetic distance for multi-copy markers Markers DYS385, DYS459, DYS464 and YCAII are multi-copy Y-STR markers.

For most multi-copy markers, genetic distance can be calculated by adding the differences in allele values for each of the two copies.

C. Calculation of genetic distance for multi-copy marker DYS464 - using Infinite allele model

Assuming mutations at different copies of the same marker took place in a single generation, the Infinite allele method counts the total difference between all copies of the same marker as 1, despite the fact that more than one mutation exists.

The genetic distance for DYS464 is calculated using this method.

D. Calculation of genetic distance for DYS389i/ii

DYS389i is embedded in DYS389ii; therefore, the DYS389i values are included in DYS389ii values. Genetic distance can be determined by adding up two differences: differences in DYS389i values and differences in the second part of DYS389ii values, which are obtained by subtracting the DYS389ii values by DYS389i values.

Mar 18 2008

What is a Short Tandem Repeat (STR)?

Published by Genebaseteam under Y-DNA STR Markers

STR Overview

A short tandem repeat (STR) is a type of DNA polymorphism where short sequences of DNA are repeated. STRs are usually considered “junk DNA” because they are introns and do not code for protein. The number of times a DNA sequence is repeated for a given STR is variable between different individuals and thus, STRs are often useful for forensic or genealogical studies.

Y-DNA STRs

The STRs found in the Y-DNA are very useful for genealogical studies to examine male lineage. A male individual’s Y-DNA STR is unique to his paternal line and can be determined through Y-DNA STR Testing. That means that all males who are descendents from the same male lineage will have exactly the same or a very similar Y-DNA STR pattern.

Mar 18 2008

What is the difference between testing fewer STR markers versus testing more markers?

Published by Genebaseteam under Y-DNA STR Markers

A number of STR markers can be tested on the Y-DNA. The more markers that are tested, the more discriminating the matches when comparing to other individuals.

For example, comparison of 12 markers alone is generally not powerful enough to distinguish family lines and can give inconclusive results. The more markers that are available for comparison, the more discriminating the comparison becomes.

There are two major advantages for comparing more markers:

1.  To prevent false positives
2.  To obtain conclusive results

Scenario:

Mr. Jones has been studying his family’s ancestry for several years and has started a “Jones” family study based in Arizona. He is interested in confirming that his family line is linked to a “Jones” line in New York. Although there are rumours that the two lines are related, Mr. Jones does not have the paperwork to prove this link. Mr. Jones is also interested in finding out whether his line is linked to any other Jones lines worldwide.

Mr. Jones had previously chosen to test just 12 markers. After testing, he uses the 12 markers to search the DNA database and finds out that he is a perfect match to the Jones line in New York. However, he also finds that he has a perfect match to over 200 individuals in the database, and over half of them do not even share his surname. How is this possible? Does it mean that he is related to everyone who matches him at the 12 markers? No, this simply means that data from only 12 markers is not powerful enough to distinguish Mr. Jones from other family lines.

To clarify this, Mr. Jones decides to increase his markers to 20. He enters the results of his 20 markers into the database and this time narrows down the number of matches.  In fact, now, only 18 people match him perfectly at his 20 markers, including the Jones line in New York. Surprisingly, many of the individuals who used to match perfectly at 12 locations only match at 14 or less out of the 20 locations tested, confirming that there is no familial link with most of the 200 individuals identified in the 12 marker test (more than 3 mismatches indicates that two family lines are not closely related).

To further clarify the findings, Mr. Jones decides to upgrade to a 44 marker test. This time, he finds out that he is a perfect match at all 44 markers to only two lines, a Jones line in England, and a Jones line in the United States. After contacting the two lines and comparing paperwork and stories, Mr. Jones was able to confirm that his line was indeed definitely linked to both lines and he is now able to add both new lines to his family tree.

Surprisingly, Mr. Jones was also able to find out that only 43 out of the 44 markers matched with the Jones line in New York. This confirms that although the Jones line in New York is related to his line, they are likely more distantly related.

Mr. Jones also discovered that he had a close match to 4 other Jones lines (43 out of the 44 matched) and he is now pursuing the possibility that the 4 other lines are also distantly related to him (MRCA analysis dictates that 1 mutation occurs every 500 generations, and thus we would detect a mutation every 12 generations with the 44 marker test).

Mr. Jones is now trying to recruit more Jones males from throughout Europe to try to reconstruct and relink his family line.

Conclusion:

12 markers were not discriminating enough for Mr. Jones to pinpoint his family lines. After increasing to 20 markers, Mr. Jones was able to obtain more useful information and was able to eliminate false matches generated by the 12 markers. However, after increasing to 44 markers, Mr. Jones was able to pinpoint the people that he was looking for and was furthermore able to accurately answer his questions about his relationship to the Jones line in New York. Mr. Jones can continue to carry on his research, and as more and more people globally are tested and are added to the database, Mr. Jones will be able to reconstruct his family line in great detail and re-unite with Jones worldwide who are descendents of his family line.

Navigate Home | Family Tree | Friends | DNA Ancestry | Photos | Journals | Groups | My Profile
More Info Tutorials | News | Learning Center | Resources | About Genebase | Store | Applications | Terms
Get Help Help Center
© 2003 − 2008 Genebase Systems. All Rights Reserved.
Powered by WP.