Skip to content

Commit adac4fb

Browse files
committed
continued vc
1 parent 45f1987 commit adac4fb

5 files changed

Lines changed: 31 additions & 3 deletions

File tree

images/tablet1.png

173 KB
Loading

images/tablet2.png

157 KB
Loading

images/tablet3.png

192 KB
Loading

qc.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ The Raw data were deposited at the European nucleotide archive, under the access
1111

1212
```
1313
wget ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR957/SRR957824/SRR957824_1.fastq.gz
14-
wget ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR957/SRR957824/SRR957824_1.fastq.gz
14+
wget ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR957/SRR957824/SRR957824_2.fastq.gz
1515
```
1616

1717
## FastQC

variant_calling.md

Lines changed: 30 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -12,13 +12,41 @@ You will need a reference sequence to map your reads to. You can download the re
1212

1313
or from the command-line with
1414

15-
`wget https://github.com/HadrienG/tutorials/tree/master/data/pO157_Sakai.fasta`
15+
`wget https://raw.githubusercontent.com/HadrienG/tutorials/master/data/pO157_Sakai.fasta`
1616

1717
This file contains the sequence of the pO157 plasmid from the Sakai outbreak strain of E. coli O157. In contrast to the strain we are working on, this strain is available as a finished genome, i.e. the whole sequence of both the single chromosome and the large virulence plasmid are known.
1818

1919
### indexing the reference
2020

21+
Before aligning the reads against a reference, it is necessary to build an index of that reference:
22+
2123
```
2224
module load bowtie2
23-
bowtie2-build $ref $prefix
25+
bowtie2-build pO157_Sakai.fasta pO157_Sakai
2426
```
27+
28+
### aligning reads
29+
30+
`bowtie2 -x pO157_Sakai -1 reads_1.fastq -2 reads_2.fastq -S output.sam`
31+
32+
The output of the mapping will be in SAM format. you can find a brief explanation of the SAM format [here](files_formats.md)
33+
34+
### visualising the alignment
35+
36+
To view the outcome of the read mapping, we will use a program called Tablet, that can be run without administrated privileges. Download it [here](https://ics.hutton.ac.uk/tablet/).
37+
38+
Start the program and select Red Button – Open. Choose your SAM-file and pO157_Sakai.fasta as a reference.
39+
40+
![tablet1](images/tablet1.png)
41+
42+
Select the only contig to the left.
43+
44+
![tablet2](images/tablet2.png)
45+
46+
Next, download pO157_Sakai.gff from [here](data/pO157_Sakai.gff). Select import features in Tablet and import pO157_Sakai.gff. This file contains annotations, i.e. what is encoded in each part of the DNA sequence. Two tracks (CDS + GENE) will be added.
47+
48+
Navigate the mapping using the zoom and pan left/right etc. Under Colour schemes you can highlight bases that do not match the reference. Holding your pointer over a CDS will show you a description of the genetic region.
49+
50+
![tablet3](images/tablet3.png)
51+
52+
Does the mapping data confirm the presence of an intact pO157 plasmid in the St. Louis outbreak strain?

0 commit comments

Comments
 (0)