Federal government websites often end in. The site is secure. Previous analyses have indicated that environmental triggers associated with weather conditions, specifically air moisture and temperature in the region of the saiga antelope calving during the day period running up to the event, were critical to the proliferation of latent bacteria and were comparable to conditions accompanying historically similar die-offs in the same areas.

We investigated whether additional viral or bacterial pathogens could be detected in samples from affected animals using 3 different high-throughput sequencing approaches. We did not identify pathogens associated with commensal bacterial opportunisms in blood, kidney, or lung samples and thus concluded that P. The saiga antelope Saiga tatarica tatarica and S.

Each year during the month of May, Saiga antelopes gather in Kazakhstan for calving. Mass die-offs in their populations have been reported previously and were attributed to viral and bacterial etiologies, including pasteurellosis 2. However, the diagnosis in most of these events has been unreliable because of insufficient fresh sampling and diagnostic work 2. During a large outbreak in , extensive diagnostics and environmental studies were undertaken, subject to restricting factors such as remoteness and limited cold chain resources.

Annual disease monitoring in saiga antelopes had been established after die-offs occurred in western Kazakhstan in , and an international multidisciplinary research team was on the ground at the time of the die-off, performing routine surveillance 3 , 4. These subgroups of saiga antelopes were spread discretely across a landscape of several hundreds of thousands of square kilometers.

The number of dead animals constituted more than two thirds of the global population of saiga antelope at the time. Laboratory results on the microbiologic, pathologic, and environmental conditions at the time of the outbreak suggested hemorrhagic septicemia caused by Pasteurella multocida serotype B and triggered by environmental conditions 3 , 6. However, whether a second unknown infectious agent had predisposed the animals to infection with P.

Given the opportunistic nature of Pasteurella , the objective of our study was to attempt to identify whether any additional unknown potential causative pathogens were present in samples taken from 10 animals that might may have contributed to the die-off. The first dead animals were detected in the Amangeldy District Kostanay region of Kazakhstan on May 10, , and additional die-offs were recorded in unconnected discrete locations in the Aktobe and Akmola regions 3.

A primary diagnosis of hemorrhagic septicemia as the cause of death was proposed at the sites on the basis of clinical signs and gross pathology. We took FTA papers of whole blood spots from 8 freshly dead, female animals Table 1 in a 2-km radius on the last 2 days of the operation and sent them to international reference laboratories for high-throughput sequencing HTS protocols.

FTA cards were used as backup given the limited resources available and difficulties in maintaining cold chain and in transportation of fresh samples to local laboratories. Lung and kidney tissue from 2 dead saiga antelopes lung tissue from animal X and kidney tissue from animal Y from the Turgai River region were also processed for 16S metagenomics sequencing in the city of Almaty, Kazakhstan. Although these samples were from a region km from the site where the FTA card samples were taken, they were considered part of the same saiga antelope population.

Given the uniformity of the clinical syndrome and consistency of the pathogenesis, the sample of cases selected was small relative to the scale of the die-off, but each case was evaluated in considerable depth and considered representative of the affected population on the basis of the consistent pathology and disease characteristics observed in all the affected animals 3.

Two of the 8 samples were processed by both laboratories. Lung and kidney tissue from 2 dead saiga antelopes Table 2 were tested for 16S bacterial diversity by using a 16S metagenomic sequencing protocol developed by the Institute of Microbiology and Virology in Almaty Figure 2.

Geographic distribution of saiga antelope die-off events, Kazakhstan, Red and orange areas indicate known outbreak locations of the 3 saiga populations. Inset shows area in relation to the rest of Kazakhstan and neighboring countries of central Asia.

Outline of the process of sampling and high-throughput sequencing protocols performed at 3 research institutes in an investigation of a mass die-off of saiga antelopes, Kazakhstan, We analyzed reads from each of the parallel investigations by using established bioinformatics pipelines to identify microbial agents present within each sample. All raw datasets, de novo assemblies, and 16S sequencing metagenome datasets have been submitted to the European Nucleotide Archive and GenBank accession nos.

The classification of sequenced reads into different taxonomic groups was conducted by using 2 approaches; the first was a k-mer—based approach that assigned each read independently Table 3 ; Appendix Table 1 , and the second was a de novo approach that first assembled reads into contigs and then assigned contigs Table 4 ; Appendix Tables 2—4.

Neither of these 2 approaches conclusively identified a single virus as a causative agent in all samples. In all 6 samples tested, The specificity of this finding was increased for the P. QC, quality control. The de novo analysis approach also did not identify any homologies with unexpected viral genomes. Several bacteria were identified by both k-mer and de novo analysis protocols, including P.

We subjected these contigs to an extended analysis in which they were first aligned to BLAST databases with tblastx to find similarities at the protein level. That analysis generated matches for 35 contigs; the distribution of the matches in terms of species mirrors quite closely the one found by nucleotide BLAST Appendix Tables 2—4. This approach returned hits for 87 contigs Appendix Tables 2—4 , of which most appeared to be homologs of bacterial proteins.

No further pathogens could be conclusively identified using this analysis. Further analysis of the assembled sequences attributed to P. By using the RIEMS analysis pipeline, we performed taxonomic analysis of the sequencing reads obtained from libraries generated from RNA extracted from 4 blood spots from FTA cards that had been transcribed into cDNA using random hexamer priming followed by shotgun library preparation.

Of the samples tested at FLI, samples 2 and 5 were also tested at Pirbright. Overall, these analyses classified The remainder mainly represented host sequences 0. With a few exceptions for phage reads, no reads were classified as being of viral origin, which was concordant with findings of the Pirbright dataset.

In all samples, the proportion of reads remaining unclassified after analysis of the nucleic acid sequences was low 0. Therefore, the information content of the unclassified portion of the datasets was too low to provide additional information even by additional analyses on the basis of the amino acid sequences deduced from these reads. To conduct a detailed analysis of the numerous P.

We then performed blastx 7 analyses of the resulting contigs for a basic function prediction of the expressed genes. Besides detecting genes encoding proteins of gene expression, general metabolism, and cell division, these analyses detected several proteins associated with pathogenicity. For example, proteins facilitating active iron uptake iron ABC transporter permease [GenBank accession no. These analyses also revealed expression of genes encoding stress- and starvation-induced proteins stringent starvation protein A homologue [accession no.

Among the variable regions of 16S gene, V3 is a highly variable region that can distinguish bacteria to the genus level. V4 is also efficient but less so than V3 8. The output of the workflow classified the reads at the primary taxonomic levels kingdom, phylum, class, order, family, genus, and species. Sequencing statistics revealed the number of total reads to be 63, for lung tissue and 15, for kidney tissue.

The number of reads passing quality filtering was 58, for lung tissue and 14, for kidney tissue. The percentage of reads passing quality filtering was Of all reads generated, Other species were Pasteurellaceae Saiga antelopes are a critically endangered species 1 , and the population is increasingly fragmented and vulnerable to stochastic events such as disease epidemics. The saiga antelopes undertake large-scale seasonal migrations between their summer and winter ranges because of the extreme variation in climate conditions and the need for pastures offering sufficient forage.

The calving sites are highly variable from year to year and depend on plant phenology, environmental factors, and anthropogenic effects 9. A few incidences of infectious disease, including foot-and-mouth disease, have been confirmed 12 , but most events were attributable to pasteurellosis; M. However, diagnoses are lacking comprehensive clinical, pathological, epidemiologic, and environmental investigation and remain tentative in all cases outside the event.

Diagnosis of wildlife deaths is constrained by the fact these populations are not managed nor always monitored regularly, meaning die-offs occur frequently and investigators often do not have access to fresh carcasses. In the saiga antelope event, a monitoring team was in place in 2 of the 15 die-off locations and were equipped for general diagnostic work.

This situation was unusual and provided a unique opportunity, but the unpredictability of such an event happening limited the extent of the outbreak investigation. Sampling was necessarily strategic, and because all of the animals in the population were affected by the same syndrome and died, the sample size did not need to be large or statistically representative.

Each case would have an equal chance of providing the result, and failure to diagnose would be more likely a product of insufficient material per case or loss of viability of organisms because of cold chain and storage issues. Nevertheless, the findings obtained from this work are representative of the population for a few reasons.

First, the clinical syndrome was uniform in both the adult and calf populations. We observed no statistically significant variation in the temporal progression once symptoms were noted, and clinical signs and gross pathology were highly consistent. In addition, the rapidity of the syndrome precluded large numbers of cases being investigated by our relatively small team because necropsy and sampling for each case took several hours to complete.

Previous studies had demonstrated the absence of other potential causative agents by diagnostic PCR, including bacteria e. Furthermore, these studies used capsular typing with specific primers to show that strains of P. Our 16S analysis also showed that the P. Each of these workflows demonstrated the potential for different experimental challenges in obtaining metagenomic datasets e.

The high sensitivity of such methods to detect small amounts of nucleic acids also poses challenges in terms of prevention of contamination and false-positive results. Caution should be exercised in drawing conclusions from such datasets without appropriate validation.

In addition to blood spots, other tests, including bacteriologic and virologic tests on various tissues and samples taken, were conducted locally in Kazakhstan at government laboratories and reported elsewhere 3.

Despite the high sensitivity of the methodologies we used, our study is somewhat limited by the sample type FTA cards , which precludes the detection of pathogens in lymphoid tissues and other organs. The use of FTA cards might also introduce biases in the testing protocols, which can favor or hinder the detection of certain types of viruses Both metagenomic protocols conclusively identified Pasteurella spp.

Further analysis of P. Our de novo assembly approach also identified short contigs that could not be attributed to any sequence present in several BLAST databases; of those, only 47 were identified using tblastx. Overall, the amount of unexplained sequence seems relatively small, in particular when considering the substantial number of species of bacterial, viral, and eukaryotic genome that remain either to be discovered or characterized.

The simple fact that not all organisms have been sequenced or are available on central sequence repositories will always contribute to a percentage of unidentifiable reads. The potential pathogenicity is inherent in the organism and can be triggered opportunistically at any time in response to environmental triggers. The epidemiology of and observations on the spatiotemporal distribution of ill animals and carcasses in this study suggests that transmission of bacteria from animal to animal did not occur in most cases except from mothers to calves, which occurred through infected milk.

