Build a tree from a different bug

This paper on Neisseria gonorrhoeae in South Africa uses a reference mapping approach to build the phylogeny used in Figure 1.

The challenge for this homework is to fetch the reference sequence used in the paper, fetch at least some of the samples in the Bioproject PRJNA575338 and build a phylogeny. Breaking this down:

Questions:

  1. What is the reference sequence used in this project?

  2. How can you download this reference genome in Genbank format?

  3. How many sequences are there in the PRJNA575338 Bioproject?

  4. How can you download these sequence reads?

  5. How can you build a sample sheet for this dataset?

  6. What would the Bactopia command line for analysing this dataset using snippy look like?

  7. Final challenge: can you run Bactopia and get the expected results with this dataset?