Aboriginal means relating to a group of people native to a geographic region. These are the original inhabitants of a region.
Adenine is one of the four bases that make up our DNA. It is abbreviated “A.”
The other bases are thymine (T), guanine (G), and cytosine (C). Adenine always pairs with thymine.
An administrator (a.k.a. Group Administrator) is someone who is in charge of Group Projects. The Group Projects at FamilyTreeDNA are run by unpaid volunteer administrators.
Admixture refers to ancestry from more than one recent population group. Many people today have ancestry from more than one population and/or location.
American Indian is the popular term for the Indigenous Peoples of the Americas. These are the descendants of the early settlers in North and South America.
An ancestor is someone from whom you descend. For example, your grandparents are your ancestors.
In genetic genealogy, the ancestral haplotype is the set of marker values of your ancestor.
The ancestral signature is the oldest known or suspected haplotype for a lineage.
The ancestral state of an allele is the assumed initial condition (value) of the allele and is often represented by the sequence reference.
See also: Derived State
Anthrogenealogy is the study of human origins, recent and distant, using both DNA testing and genealogy.
Anthropology is the study of human origins and culture.
Ashkenazi is the branch of the Jewish population that settled in Germany and then Eastern Europe during the Jewish Diaspora.
A back mutation is when a marker value changes back to its original value.
A base is a unit or building block of DNA. Adenine (A), cytosine (C), guanine, (G), and thymine (T) are the four primary bases in DNA. The order of bases is the sequence of DNA.
In genetics, nucleotides are called bases. A base pair (bp) is two complementary nucleotides on opposite strands of DNA. Base pairs are measured using metric units.
- 1 base pair = 1 base pair (bp)
- 1,000 base pairs = 1 kilo-base (kb)
- 1,000,000 base pairs = 1 mega base (Mb)
Biogeographical analysis it the study of how an organism varies genetically with respect to its geographic location.
In genetic genealogy, we are all part of the human race. For us, biogeographical analysis refers to analyzing the differences that have developed between population groups. These are small changes that have happened since our common ancestors migrated from Africa.
A buccal cell is a type of cell found in cheek tissue inside the mouth.
A catalyst is a substance which starts or speeds up a chemical reaction without being affected by that reaction.
A centiMorgan (cM) is a measurement of how likely a segment of DNA is to recombine from one generation to the next. A single centiMorgan is considered equivalent to a 1% (1/100) chance that a segment of DNA will crossover or recombine within one generation.
For humans, one million base pairs (bp) average about one centiMorgan. However, the rate of recombination is highly variable.
A centromere is one of the parts of each chromosome. It is a dense area that joins together the two chromatids (arms) of each chromosome.
A chromatid is one of two strands of a chromosome.
A chromosome is a structure found in the nucleus of a cell that contains genetic material. Humans have 23 pairs of chromosomes: 22 pairs of autosomes and one pair of sex chromosomes.
A clade is a group of related individuals.
A coding region is DNA that contains genes. In genetic genealogy, this most often refers to the part of the mitochondrial genome that contains genes.
The Cohanim Modal Haplotype (CMH) is the Y-chromosome (paternal) profile most frequently found in men with an oral tradition of Cohen ancestry. It is a Y-chromosome DNA STR (Short Tandem Repeat) haplotype.
Cohen is the Hebrew word for priest, which refers to a direct male descendant of Aaron, the brother of Moses. The plural is Cohanim.
The CODIS system uses marker locations in the autosomal DNA. In the United States, the FBI maintains a CODIS test result database to identify people and solve crimes.
Complementary sequences are opposing strands of DNA. They bond together to form the double helix. The bases always complement one another. Adenine and thymine pair together. Cytosine and guanine pair together.
Convergence is the process of two genetically distant haplotypes changing over time to resemble one another.
Cytosine is the “C” of the four bases that make up DNA. Cytosine always pairs with guanine.
The other bases are adenine (A), guanine (G), and thymine (T).
A deletion is when one or more of the letters (nucleotides) of your genetic code is deleted.
DNA, deoxyribonucleic acid, is the genetic code that makes each of us a unique individual. Humans inherit about one half of their genetic code from each of their parents. Our genetic code then holds the story of our heritage that has been passed down through the generations.
The derived state of an allele is the changed (mutated) condition of the allele that differs from the ancestral state.
See also: Ancestral State
A descendant is someone who descends from a specific ancestor. For example, your children and grandchildren are your descendants.
A diaspora is the permanent displacement of a population from one location to a different location or locations.
DNA amplification is the production of many DNA copies from one or a few copies or fragments.
DNA replication is the process by which the DNA double helix makes a copy of itself. It uses the old DNA as a template for the synthesis of new DNA strands. In humans, replication occurs in the cell nucleus.
A DNA segment is any continuous run or length of DNA. It is described by the place where it starts and the place where it stops.
DNA sequencing is the process of determining the exact order of the nucleotide bases in a segment of DNA.
A double helix is the twisted shape DNA forms when its two strands bond together. It looks like a twisting or rotating ladder.
An endogamous population is one where the members usually only marry within the population group. The bases for endogamy may be geography, ethnic identity, social class, or religion. Long periods of intermarriage have left many endogamous populations with lower than average levels of genetic diversity. Examples of historically endogamous populations are the Amish, the Basque, and the various sub-populations of the Jewish Diaspora.
A protein that facilitates a specific chemical reaction by working as a catalyst.
An exact match is when two people have exactly the same results for all markers or regions compared.
Exogamy is marriage outside of a cultural or population group.
A factoid is an unverified piece of information. In genetic genealogy, it refers to genetic findings that are over reported in popular culture. These factoids may have no validity.
FamilyTreeDNA Time Predictor (FTDNATiP™) is a program used to calculate estimates of Time to the Most Recent Common Ancestor (TMRCA) for paternal lineages. It is the world’s first calculator that incorporates mutation rates specific to each marker. This increases the power and precision of estimates.
A Gene is a region of DNA which codes for a protein or part of a protein.
The genealogical time frame is the most recent one to fifteen generations. Recent genealogical times are the last one to five generations.
A Genealogical Data Communication (GEDCOM) file is a special file format that was developed to provide a standard for encoding genealogical data. It is not used by most family tree software packages but most can import and export to GEDCOM format. Because of this, it is today used by many genealogists to exchange pedigree data files.
Genealogy is the study of family history.
A generation is the number of years between the birth of the parents and the birth of their children. Different studies use different numbers of years per generation. At FamilyTreeDNA, we use 25 years. However, for Time to the Most Common Ancestor (TMCA) calculations, it is the number of generations that is important.
A genetic cousin is someone that meets the criteria to be a genetic match in genetic genealogical testing that may or may not be a known cousin.
There are two meanings for Genetic Distance:
- Genetic Distance is the number of differences, or mutations, between two
sets of results. A genetic distance of zero means there are no differences in
the results being compared against one another, i.e., an exact match. This is
the meaning when comparing Y-chromosome DNA or mitochondrial DNA.
- For autosomal DNA comparisons, genetic distance may refer to the size of a
DNA segment. The genetic distance is then the length of the segment in
Genetic drift is when a subset of a population moves to a different location and becomes genetically less like the main population. This happens over many generations.
Genetic genealogy is the use of your DNA to solve genealogy puzzles.
Genetics is the study of genes and heredity; the study of DNA.
A genome is the entire complement of an organism’s genetic material. This may refer to the DNA of a gamete, organelles (mitochondria and chloroplasts), organism, or species.
The human nuclear genome is composed of 46 chromosomes (23 pairs). They contain a total of 3 billion base pairs.
The human mitochondrial genome is composed of a single circular DNA sequence that contains 16569 base pairs.
The genetic makeup of an individual organism.
Glacial Maximum is the scientific term for the peak of an ice age.
The Group Administration Page (GAP) is the user interface that FamilyTreeDNA Project Administrators use to manage Group Projects. The term GAP is also often used for Project Administrators.
Guanine is the “G” of the four bases that make up DNA. Guanine always pairs with cytosine.
The other bases are adenine (A), cytosine (C), and thymine (T).
A haplogroup is a major branch on either the maternal or paternal tree of humankind. Haplogroups are associated with early human migrations. Today these can be associated with a geographic region or regions.
A haplotype is the set of values for a set of DNA values. For example, the results of the Y-DNA12 test for one person is their haplotype.
Two individuals that match exactly on all markers have the same haplotype.
Heredity is the transmission of genetic material from parents to offspring.
Heterozygous means that the two genetic code values (alleles) at a point in the genetic code are different.
Homozygous means that the two genetic code values (alleles) at a point in the genetic code are identical.
The Human Genome Organization (HUGO) is the scientific entity to which, among other things, researchers submit new short tandem repeat (STR) markers for number assignment.
A hypervariable region (HVR) is a part of the mitochondrial genome. There are two human hypervariable regions: HVR1 and HVR2. They do not contain genes. Therefore, they have a faster change (mutation) rate than the coding part of the mitochondrial genome.
IBD stands for identical by descent. This means the DNA matches because it comes from a common ancestor. IBD can refer to a single mutation or to a segment of DNA. If a mutation or segment of DNA is IBD among a group of people, it comes from a common ancestor.
The Family Finder relationship predictions require a minimum number of results in a row to be identical in order to identify that the segment is likely to be IBD.
IBS stands for identical by state, meaning the DNA matches by coincidence. When two individuals share numerous individual results without being related, those results are IBS.
In genetics, inbreed refers to someone whose parents are related. It most often refers to cases where the relationship is within five generations.
An indel is a type of mutation where genetic code is lost or gained. These are insertions and deletions.
An insertion is when one or more of the letters (nucleotides) of the genetic code is added.
A descendant of the Hebrew tribes.
Junk DNA is a popular term for DNA that does not contain genes. This is non-coding DNA. Most of the genome consists of non-coding DNA. Because it does not code for specific function, it was long thought to be “junk.” However, scientists have found that in addition to containing markers that are helpful for genetic genealogy, parts of these non-coding regions have regulatory and other functions.
A descendant of the Hebrew tribe of Levi. There are strict historic guidelines for who is considered a Levite.
A lineage is all descendants of a specific ancestor.
A locus is a specific location in your genetic code. In a genetic map of our DNA, the locus tells us where to find any base. Each locus is named sequentially so that on chromosome 15 locus 26039212 comes after locus 26039211. The plural of locus is loci.
The stage in the reproductive process in which sperm and egg cells are formed. During meiosis, the autosomal chromosomes recombine and mutations may occur.
A microarray or SNP chip is a high density DNA test that is able to test many thousands of single nucleotide polymorphisms (SNPs) at once. The microarray chip is able to capture much of the diversity in someone’s genetic code by sampling known polymorphic loci.
Mitochondria are specialized subunits (organelle) within cells.
In humans, mitochondria are responsible for cell respiration and for producing energy. They evolve into their current state from separate organisms that form a mutually beneficial (symbiotic) relationship with the larger cell. Because they were once independent, they have their own mitochondrial DNA (mtDNA) genome.
This genome is passed from human mother to child.
The genetic material found in mitochondria. It is passed down from females to both sons and daughters, but sons do not pass down their mother’s mtDNA to their children.
Mizrahi is the branch of the Jewish population that settled in Middle Eastern, North Africa, Caucasus countries. This may include Sephardi Jews who move to these places.
The most common result for each marker tested in a group of results.
In genetic genealogy, the Most Recent Common Ancestor (MRCA) is the ancestor shared most recently between two individuals.
A heritable change that occurs in genetic material. It may lead to a different number of repeats of a certain sequence or a change in one of the bases in a sequence.
The frequency with which random mutations occur.
myFTDNA is the user interface that FamilyTreeDNA customers use to view their test results and matches.
Non-coding DNA is DNA that does not contain genes. It may have other functions.
The non-recombining Y (NRY) is the part of the Y chromosome that does not recombine with the X chromosome.
Nuclear DNA is the genetic code that is found inside of the cell’s nucleus. Our autosomal and sex chromosomes are nuclear DNA.
Nucleic acids are the basic components of our genetic code. DNA is made up of four types of nucleic acids: adenine (A), cytosine (C), guanine (G), and thymine (T).
Nucleotides are structural components of our genetic code. Each nucleotide is composed of one base plus a sugar molecule and a phosphate molecule. The bases are adenine, thymine, guanine, and cytosine, normally represented as A, T, G, and C, respectively.
The membrane-bound organelle containing the chromosomes.
An organelle is a part of a cell that performs a specialized function. Examples are the nucleus and the mitochondria.
Outbreed is when an individual’s parents’ common ancestry was more than ten generations in the past.
The P Arm is the shorter of the two sides (short arm) of a chromosome.
A palindrome is something that reads the same way in either direction. In genetic genealogy, it is sections of DNA that read the same way. It is most significant for Y-chromosome DNA because palindromes may be copied over each other.
A parallel mutation is when the same genetic change happens in completely unrelated lineages.
For STRs, a plot which shows the length of a fragment of DNA. This allows its allele value to be measured.
Phylogenetics is the study of how genetics can be used to show how people are related.
A phylogenetic tree is the reconstruction through genetics of a lineage.
The enzyme that starts the process of making nucleic acids or assembling RNA or DNA.
A technique allowing the production of multiple copies of extremely small amounts of DNA fragments using DNA polymerase and specific primers.
A Polymorphism is a change in genetic code (mutation) that has reached a greater than 1% frequency in a local or global population. In genetic genealogy, we most often use it to describe backbone branch defining mutations. These are related to backbone haplogroups.
A population is a group of people who inhabit a geographic region or share a common origin.
A population bottleneck is when a population is greatly reduced in size.
A short DNA sequence used in the polymerase chain reaction to initiate DNA synthesis at a particular location.
The main building block of our cells. Each one has a specific function.
Principal component analysis is a mathematical method that attempts to separate an admixed data set (here a genetic profile) into one or more contributing groups. It was invented by Karl Pearson in 1901 and is sometimes called the Karhunen-Loève transform or proper orthogonal decomposition
Recombinational Loss of Heterozygosity (recLOH) is a process by which one copy of genetic code is copied over others. The result is identical values. In genetic genealogy, this is most significant for the Y chromosome. Palindromic STR (short tandem repeat) markers may be copied over each other.
For example, DYS385 may have a,b values of 12,19 for a father. His son may have values of 12,12. This is a single recLOH event.
Recombination is the mixing of the DNA on each chromosome that you receive from your mother and father. Different chromosomes and different parts of each chromosome are more or less likely to recombine in a single generation.
A protein that recognizes a certain sequence of DNA and cuts the DNA at that site.
Sephardic is the branch of the Jewish population that settled in Spain during the Jewish Diaspora.
The X or Y chromosome. Normally males have one X and one Y and females have two Xs.
A short DNA motif (pattern) repeated in tandem. ATGC repeated eleven times would give the marker a value or allele of 11.
A single nucleotide polymorphism (SNP) is a change in your DNA code at a specific point.
A sister clade is one of two haplogroups or subclades that are at the same level on a phylogenetic tree. For Y-chromosome research, this is sometimes a brother clade.
For example, on the maternal tree, H6a and H6b are sister clades.
A subclade is a subgrouping in the haplogroups of the human genetic trees. This may be either the Y-chromosome tree or the mitochondrial tree. Subclades are more specific to a location or population group than the major branches (haplogroups).
A last name or family name traditionally in many Western European countries passed down from a father to his children.
A telomere is the end of a DNA chromosome. Each of our autosomal and sex chromosomes has two telomeres.
The “T” of the four bases that make up DNA. The other bases are adenine (A), cytosine (C), and guanine (G). Thymine always pairs with adenine.
The amount of time or number of generations since individuals have shared a common ancestor. Since mutations occur at random, the estimate of the TMRCA is not an exact number (i.e., seven generations) but rather a probability distribution. As more information is compared, the TMRCA estimate becomes more refined.
A transition is a type of change in the genetic code (mutation). Examples are A < -> G and C < -> T.
The passage of genetic material from one generation to the next.
A transversion is a type of change in the genetic code (mutation). Examples are A < -> C and G < -> T.
Unique Event Polymorphisms are bascially rare mutations that occur so infrequently that they are considered to all come from a single, common ancestor.
The most common Y-DNA haplotypes found in Europe’s most common Y-DNA haplogroup, R-M269.
One of the two sex chromosomes, X and Y. Males receive a single X chromosome from their mother, while females receive an X chromosome from both their mother and their father. X is the sex chromosome that is present in both sexes, singly in males and doubly in females.
One of the two sex chromosomes, X and Y. The Y chromosome passes down from father to son. Females do not receive it. As the Y chromosome is passed on through the paternal line, it is valuable for surname based genealogy studies.
A graphic representation of the Y-DNA haplogroups according to the Y Chromosome Consortium (YCC) classification. Haplogroup names and major clades are labeled and mutation names are given along the branches of the trees.
The current version of the tree is the YCC2010.
A no call occurs when a particular single nucleotide polymorphism (SNP) being analyzed has insufficient data to be confidently given a genotype value.
For FamilyTreeDNA, a novel variant is a difference on a person’s Big Y test from the reference sequence that has not been before seen. Novel variants may or may not be unique to an individual and will be listed separately from the Known SNPs on Big Y until sufficient data is available to name and add the position of the SNP to the Known SNPs list.
The Longest Block in the autosomal Family Finder test refers to the the longest continuous segment of autosomal DNA that is shared between two individuals.
The Q Arm is the longer of the two sides (long arm) of a chromosome.
Your earliest known ancestor is the furthest person who you have documented on a specific genealogical line. In genetic genealogy, it usually refers to someone on a direct maternal line (the mother, her mother, her mother’s mother, etc.) or on a direct paternal line (the father, his father, his father’s father, etc.).
The revised Cambridge Reference Sequence (rCRS) is the revised sequence based on the first mtDNA sequence completed (Cambridge Reference Sequence).
- HVR1 – 16001 to 16569
- HVR2 – 00001 to 00574
- Coding Region – 00575 to 16000