Module 3
Data Analysis — Quiz Part 1
← Back to Module
Questions/ Memory Test
Question 1
What is FASTA file format? (Write in short)
Sample Answer: FASTA is a text-based format for representing nucleotide or protein
sequences. It consists of two parts: (1) a header line starting with '>' followed by a unique identifier,
organism name, and gene/marker details; (2) one or more lines of the actual sequence data using standard
nucleotide codes (A, T, G, C). Multiple sequences can be included in a single file, each starting with a
new '>' header.
Question 2
Can we use two primers to sequence more than 1600bp length? Justify your answer.
Sample Answer: No, two primers alone cannot reliably sequence more than 1600 bp. Sanger
sequencing can produce clean sequences with a maximum read length of approximately 800–900 bp per
primer. With two primers (one forward and one reverse), the maximum combined coverage would be around
1600–1800 bp, but only if there is sufficient overlap. For longer sequences, additional internal
primers are required.
Question 3
The FASTA format should be typically in lines —
Question 4
In sanger what type of sample is ideal?
Question 5
What is the average length of full length ITS Primer?