Ethical statement
Ethical approval of this study was granted from the Human Experimentation Committee of the Research Institute for Health Sciences, Chiang Mai university, Thailand (Certificate of Ethical Clearance No. 31/2022). During the research, we protect the rights of participants and their identity, and we confirm that all experiments were performed in accordance with relevant guidelines and regulations based on the experimental protocol on human subjects under the Declaration of Helsinki. Written informed consent from all volunteers was obtained prior to the interview and sample collection.
A total of 95 unrelated subjects residing in five villages of Nan province, Thailand, were enrolled with written informed consent. Volunteers were healthy subjects who were over 20years old, of Khmuic-speaking ethnicity and had no ancestors that were known to be from other recognized ethnic groups for at least three generations. We collected personal data using form-based oral interviews for self-reported unrelated lineages, linguistics, and migration histories. Following the manufacturer's instructions, we collected buccal or saliva samples and extracted DNA using the Gentra Puregene Buccal Cell Kit (Qiagen, Germany).
Genotyping was carried out using the Affymetrix Axiom Genome-Wide Human Origins array10. Affymetrix Genotyping Console v4.2s primary screening produced a total of 93 samples that were genotyped for 622,834 loci on the hg19 version of the human reference genome coordinates (genotype call rate97%). We used PLINK version 1.90b5.224 to exclude loci and individuals with more than 5% missing data and also exclude mtDNA and sex chromosome from our analysis. We further excluded loci that did not pass the HardyWeinberg equilibrium test (P value<0.00005) or had more than 5% missing data, within any population. We used KING 2.325 to determine individual relatedness, and we removed one person from each pair of first degree kinship. After these quality control measures, there are 81 Khmuic-speaking people (Fig.1) with 612,614 loci overall.
We next used PLINK version 1.90b5.2 to merge our newly obtained genotyping results with a set of genome-wide SNP data8, which included populations from East/Southeast Asia, South Asia, African Mbuti, European French, and Southeast Asian ancient samples9,10,11,12,13. It should be noted that in this collection, allelic data from ancient samples was gathered using pseudo-haploid techniques, and samples with less than 15,000 informative loci were eliminated. After filtering the positions of SNPs that can be jointly analyzed within this dataset, we excluded SNPs that had more than 5% missing data or with a minor allele frequency (MAF) less than 3.3104 or were not in HardyWeinberg equilibrium with a significance level of P<0.00005. As a result, 353,505 positions in a dataset consisting of 979 individuals from 90 populations (Supplementary Table 1 and 2) were used for subsequent analysis.
In order to investigate the genetic structure and relationships of the analyzed sample, we used PLINK version 1.90b5.2 to perform pruning for linkage disequilibrium, excluding one variant from pairs with r2>0.4 within windows of 200 variants and a step size of 25 variants. A total of 959 individuals from the sample set, excluding the Mbuti and French populations, were incorporated. There were 149,384 SNPs positions available for this analysis. The Principal Component Analysis (PCA) was carried out using smartpca from EIGENSOFT with the "lsqproject" and "autoshrink" options.
To infer population structure, we employed 155,709 SNP positions derived from a sample set of 979 individuals, which encompassed both Asian samples and the outgroups represented by the Mbuti and French populations, for the ADMIXTURE analysis. The clustering tool ADMIXTURE version 1.3.014 was run from K=2 to K=10 with 100 replicates for each K and using random seeds with the -P option. For each K, the top 20 ADMIXTURE replicates with the highest likelihood for the major mode were displayed using PONG version 1.4.726. For these PCA and ADMIXTURE analyses, the ancient samples and highly drifted modern populations (Mlabri, Onge, Mamanwa, Khamu, and Lua) were projected.
To test admixture and excess ancestry sharing, we used admixr version 0.7.127 from ADMIXTOOLS version 5.110 to calculate the f3 and f4-statistics, with assessed through block jackknife resampling across the genome and using Mbuti as the outgroup. A total of 353,505 SNPs from 979 samples were used in these analyses. Additionalf4-statistics were computed when ancient samples were involved, using French as the outgroup to avoid deep attraction to Africans and only transversions (2,94751,452 SNPs depending on the quality of samples) to avoid potential noise from ancient DNA damage patterns28. We used pheatmap package in R version 3.6.0 to visualize the heatmap of f3 and f4 profiles.
To examine the haplotype sharing between different groups, we used SHAPEIT version 4.1.329 to phase the modern samples. We employed South Asian and East Asian populations as a reference panel (excluding the Kinh Vietnamese) and the recombination map from the 1000 Genomes Phase330 was also used. Our analysis specifically focused on modern population data, consisting of 359,539 SNPs. For the preparation of the reference panel, we extracted individuals of East and South Asian descent, as well as the overlapping sites with our data, for each chromosome from the 1000 Genomes Phase3 data using bcftools version 1.4. The phasing accuracy of SHAPEIT4 can be improved by increasing the number of conditioning neighbors in the Positional Burrows-Wheeler Transform (PBWT) on which haplotype estimation is based29. We conducted phasing with the option -pbwt-depth 8 for 8 conditioning neighbors, while keeping other parameters as default. Subsequently, we employed ChromoPainter version 231 on the phased dataset to initiate the investigation of haplotype sharing with sample sizes for each population were randomly down-sampled to 4 and 8. The former was used for 10 iterations of the EM (expectation maximization) process to estimate the switch rate and global mutation probability. The latter was employed for the chromosomal painting process with the estimated switch and global mutation rates. The output of this process was then used for downstream analyses. We then attempted to paint the chromosomes of each individual, with all the modern Asian samples serving as donors and recipients via the -a argument. The EM estimation yielded a switch rate of approximately 251.21 and a global mutation probability of approximately 0.00001, which were subsequently used as starting values for these parameters for all donors in the painting process. The heatmap results were generated using the pheatmap package in R.
To construct the admixture graph, our initial step involved selecting backbone populations from different language families in Southeast Asia. Specifically, we used f4-statistics to choose representative ethnic groups that speak Austronesian, Tai-Kadai, Austroasiatic, Hmong-Mien, and Sino-Tibetan languages, which included Atayal, Dai, Cambodian, Miao, and Naxi, respectively. We employed the African Mbuti and North Indian populations (Gujarati, Brahmin Tiwari, and Lodhi) who speak Indo-European languages as outgroups. Our focus was on constructing the admixture graph for the Austroasiatic language family in Thailand. Thus, we categorized these populations according to their linguistic branches; Katuic (Bru and Soa), Monic (Mon), Palaungic (Lawa_Eastern, Lawa_Western, Palaung, Blang), and Mlabri. Our interested Khmuic-speaking people were divided into the Khamu (consist of four Khamu populations) and Lua (consist of two Lua populations together with HtinMal and HtinPray).
For modeling the admixture graph, we used a dataset of 359,539 SNPs from modern populations as the input for ADMIXTOOLS 232. Initially, we computed pairwise f2 statistics between the groups using the extract_f2 function with specific parameters; maxmiss=0 (no missing SNPs to calculate), useallsnp: NO (no missing data to allow), and blg=0.05 (SNP block size set in 0.05 morgans). Then, we extracted allele frequency products from the computed f2 blocks using f2_from_precomp. Next, for each scenario, we searched for the best-fitting admixture graph by running ten independent runs of find_graphs. From the 100 independent runs, we selected the one with the lowest score (computed based on residuals between the expected and observed f-statistics given the data) using random_admixturegraph. To confirm the fitting graph, we tested the graph with the lowest score using qpgraph with parameters numstart=100, diag=0.0001, return_fstats=TRUE. This allowed us to check if the absolute value of the worst-fitting Z score was below 3. Starting with no migrations (numadmix=0), we gradually added migrations until we found a fitting graph, which we considered as the best-fitting graph for that particular scenario.
Go here to see the original:
Genetic diversity and ancestry of the Khmuic-speaking ethnic groups ... - Nature.com
- ENCODE: Encyclopedia Of DNA Elements - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- 07.05.2010 - The Human Genome [ Coast To Coast AM ] - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- NOVA scienceNOW : 51 - Public Genomes, Algae Fuel, Mystery of the Gakkel Ridge, Yoky Matsuoka - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Vincent T. - Genome (Club Remix) - [Preview] - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Comparing The Human And Chimpanzee Genomes - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Whole Genome Sequencing and Its Impact on Clinical Care - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Carlos Bustamante -- "Reconstructing the Great Human Diasporas from Genome Variation Data" - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- 3 Sad Surprises: The Human Genome Project - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- The RFW interviews Genome - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Science Bulletins: Scientists Peer Inside "Superbug" Genome - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Genome : Live @ Smu's : June 3 2012 - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Inoki Genome Federation - Genome 19 - 04 02 2012 - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- THE HUMAN GENOME MUSIC PROJECT - CHROMOSOME 1 - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Genomic Medicine - Bruce Korf (2012) - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Human Genome's 'Blockbuster' Potential Undervalued in Bid GSK vs HGSI - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Announcing the Completion of the First Survey of the Entire Human Genome at the White House - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- DNA analysis Part I. Genomic Sequencing - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- The Genome Question: Moore vs. Jevons with Bud Mishra - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Genome-Wide Association Studies - Karen Mohlke (2012) - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- New human genome research aids understanding of disease [Last Updated On: September 8th, 2012] [Originally Added On: September 8th, 2012]
- UNC Lineberger scientists lead definition of key lung cancer genome [Last Updated On: September 10th, 2012] [Originally Added On: September 10th, 2012]
- Illumina Announces Expedited Individual Genome Sequencing Service (IGS) [Last Updated On: September 11th, 2012] [Originally Added On: September 11th, 2012]
- Genome research given a boost with opening of bioscience facility [Last Updated On: September 11th, 2012] [Originally Added On: September 11th, 2012]
- Re-Imagining Our Genes: ENCODE Project Reveals Genome as an Information Processing System [Last Updated On: September 11th, 2012] [Originally Added On: September 11th, 2012]
- Illumina unveils upgraded genome sequence service [Last Updated On: September 12th, 2012] [Originally Added On: September 12th, 2012]
- US Personalized Cancer Genome Sequencing Market [Last Updated On: September 18th, 2012] [Originally Added On: September 18th, 2012]
- Yale maps “uncharted” genome regions [Last Updated On: September 18th, 2012] [Originally Added On: September 18th, 2012]
- Research and Markets: US Personalized Cancer Genome Sequencing Market [Last Updated On: September 19th, 2012] [Originally Added On: September 19th, 2012]
- 3Qs: New clues to unlocking the genome [Last Updated On: September 19th, 2012] [Originally Added On: September 19th, 2012]
- Oyster Genome Pries Open Mollusk Evolutionary Shell [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Bangladeshi scientist decodes genome of deadly fungus [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Oyster genome uncover the stress adaptation and complexity of shell formation [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- The oyster genome reveals stress adaptation and complexity of shell formation [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Diseases of aging map to a few 'hotspots' on the human genome [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- GnuBIO Awarded $4.5 Million in Funding from the National Human Genome Research Institute to Develop Lower Cost Genome ... [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Oyster genome mystery unravelled [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Devangshu Datta: What's in a genome [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Pacific Oyster Genome Shows Stress Adaptation And Complexity Of Shell Formation [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- UNC Lineberger scientists lead cancer genome analysis of breast cancer [Last Updated On: September 24th, 2012] [Originally Added On: September 24th, 2012]
- Encoding the human genome [Last Updated On: September 24th, 2012] [Originally Added On: September 24th, 2012]
- Cancer genome analysis of breast cancer: Team identifies genetic causes and similarity to ovarian cancer [Last Updated On: September 24th, 2012] [Originally Added On: September 24th, 2012]
- Fungus genome map paves way for 'Snow White' jute variety [Last Updated On: September 24th, 2012] [Originally Added On: September 24th, 2012]
- New online, open access journal focuses on microbial genome announcements [Last Updated On: September 25th, 2012] [Originally Added On: September 25th, 2012]
- By Simply Sharing, Doctors Could Unlock the Genome's Potential [Last Updated On: September 25th, 2012] [Originally Added On: September 25th, 2012]
- Forget the Cloud—Knome Offers Genome Analysis in a Box [Last Updated On: September 28th, 2012] [Originally Added On: September 28th, 2012]
- BGI@CHOP Joint Genome Center to Offer Clinical Next-Generation Sequencing Services [Last Updated On: September 28th, 2012] [Originally Added On: September 28th, 2012]
- Holy Bat Virus! Genome Hints At Origin Of SARS-Like Virus [Last Updated On: September 29th, 2012] [Originally Added On: September 29th, 2012]
- Community Fundraising Effort Helps Researchers Sequence Parrot Genome [Last Updated On: September 29th, 2012] [Originally Added On: September 29th, 2012]
- UMass Med professors are sleuths of the genome [Last Updated On: September 30th, 2012] [Originally Added On: September 30th, 2012]
- Knome Introduces the knoSYS™100; First Plug-and-Play Human Genome Interpretation System [Last Updated On: September 30th, 2012] [Originally Added On: September 30th, 2012]
- First large scale trial of whole-genome cancer testing for clinical decision-making reported [Last Updated On: October 1st, 2012] [Originally Added On: October 1st, 2012]
- Should You Get Your Genome Mapped? [Last Updated On: October 1st, 2012] [Originally Added On: October 1st, 2012]
- Surprising differences between apples and pears [Last Updated On: October 2nd, 2012] [Originally Added On: October 2nd, 2012]
- 50-Hour Whole Genome Sequencing Provides Rapid Diagnosis for Children With Genetic Disorders [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- A map of rice genome variation reveals the origin of cultivated rice [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Genome analysis promises hope for breast cancer patients [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Genome Alberta Welcomes Alberta Minister of Enterprise and Advanced Education, Stephen Khan and Federal Minister of ... [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Fifty-hour whole genome sequencing provides rapid diagnosis for children with genetic disorders [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Will Low-Cost Genome Sequencing Open 'Pandora's Box'? [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Genome testing could help individualize treatments [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Would you get your genome tested? [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- The Genome — a Pandora's Box? [Last Updated On: October 4th, 2012] [Originally Added On: October 4th, 2012]
- Fast genome test could help sick newborns [Last Updated On: October 4th, 2012] [Originally Added On: October 4th, 2012]
- In-Depth Genome Analysis Moves Toward The Hospital Bed [Last Updated On: October 5th, 2012] [Originally Added On: October 5th, 2012]
- Your Verdict On Getting A Genome Test? Bring It On [Last Updated On: October 6th, 2012] [Originally Added On: October 6th, 2012]
- Genome-wide study identifies 8 new susceptibility loci for atopic dermatitis [Last Updated On: October 7th, 2012] [Originally Added On: October 7th, 2012]
- Genome-wide study identifies eight new susceptibility loci for atopic dermatitis [Last Updated On: October 7th, 2012] [Originally Added On: October 7th, 2012]
- Genome interpreter vies for place in clinical market [Last Updated On: October 10th, 2012] [Originally Added On: October 10th, 2012]
- The $1,000 Genome: A Bait and Switch? [Last Updated On: October 10th, 2012] [Originally Added On: October 10th, 2012]
- Mount Sinai School of Medicine Offers First-Ever Course with Whole Genome Sequencing [Last Updated On: October 10th, 2012] [Originally Added On: October 10th, 2012]
- First whole genome sequencing of multiple pancreatic cancer patients has been outlined [Last Updated On: October 11th, 2012] [Originally Added On: October 11th, 2012]
- Cheap genome sequences demand new rules on privacy [Last Updated On: October 11th, 2012] [Originally Added On: October 11th, 2012]
- UConn Gets Grant For Genome Research [Last Updated On: October 11th, 2012] [Originally Added On: October 11th, 2012]
- Inconsistent Genome Privacy Laws Need Toughening, Panel Says [Last Updated On: October 12th, 2012] [Originally Added On: October 12th, 2012]
- US panel calls for stronger privacy for genome data [Last Updated On: October 12th, 2012] [Originally Added On: October 12th, 2012]
- Genome Canada Board Appoints New Chair [Last Updated On: October 12th, 2012] [Originally Added On: October 12th, 2012]
- The $1,000 Genome Is Almost Here- Are We Ready? [Last Updated On: October 15th, 2012] [Originally Added On: October 15th, 2012]
- Global genome effort seeks genetic roots of disease [Last Updated On: October 31st, 2012] [Originally Added On: October 31st, 2012]
- Massive encyclopedia helps explain how the human genome works [Last Updated On: October 31st, 2012] [Originally Added On: October 31st, 2012]
- Genome evolution and carbon dioxide dynamics [Last Updated On: October 31st, 2012] [Originally Added On: October 31st, 2012]