Fastq format

The next exercise will focus on a set of files including fastq files.

The format of one entry is as below:

@Sequence 1
CTGTTAAATACCGACTTGCGTCAGGTGCGTGAACAACTGGGCCGCTTT
+
=<<<=>@@@ACDCBCDAC@BAA@BA@BBCBBDA@BB@>CD@A@B?B@@

The lines represent:
1. Header for fastq entry known as the fastq header. This always begins with a ‘@’.
2. Sequence content of sequence
3. Quality header. Always begins with a ‘+’. Sometimes also contains the same information as fastq header.
4. Quality values for each base in the 2nd line. NOTE: ‘@’ can be used as quality values.

For more information on the fastq format the below resource is good: Wikipedia fastq page