Measures Concepts
GitHub icon

FASTQ

FASTQ - Text data format

< >

FASTQ is a text data format created in 2000.

#369on PLDB 24Years Old

FASTQ format is a text-based format for storing both a biological sequence (usually nucleotide sequence) and its corresponding quality scores. Both the sequence letter and quality score are each encoded with a single ASCII character for brevity. It was originally developed at the Wellcome Trust Sanger Institute to bundle a FASTA formatted sequence and its quality data, but has recently become the de facto standard for storing the output of high-throughput sequencing instruments such as the Illumina Genome Analyzer.. Read more on Wikipedia...


Example from the web:
@SEQ_ID GATTTGGGGTTCAAAGCAGTATCGATCAAATAGTAAATCCATTTGTTCAACTCACAGTTT + !''*((((***+))%%%++)(%%%%).1***-+*''))**55CCF>>>>>>CCCCCCC65
Example from Wikipedia:
sed -e 'n;n;n;y/!"#$%&'\''()*+,-.\/0123456789:;<=>?@ABCDEFGHIJKL/鈻佲杹鈻佲杹鈻佲杹鈻佲杹鈻傗杺鈻傗杺鈻傗杻鈻冣杻鈻冣杻鈻勨杽鈻勨杽鈻勨枀鈻呪枀鈻呪枀鈻嗏枂鈻嗏枂鈻嗏枃鈻団枃鈻団枃鈻堚枅鈻堚枅鈻堚枅/' myfile.fastq # add -i to save the result to the same input file

View source

- Build the next great programming language Search Add Language Features Creators Resources About Blog Acknowledgements Queries Stats Sponsor Day 605 feedback@pldb.io Logout