README formatting
This commit is contained in:
parent
97f529ff02
commit
c9285e74e4
40
README.md
40
README.md
@ -2,12 +2,12 @@
|
|||||||
|
|
||||||
## Part A - seq function
|
## Part A - seq function
|
||||||
|
|
||||||
a: Create a vector where the first element is 1, the last element is 33, with an increment of 2 between elements.
|
a) Create a vector where the first element is 1, the last element is 33, with an increment of 2 between elements.
|
||||||
b: Create a vector with 15 equally spaced elements in which the first element is 7 and the last element is 40.
|
b) Create a vector with 15 equally spaced elements in which the first element is 7 and the last element is 40.
|
||||||
c: Use the sample function to create a vector with variable name my.dna that consists of 20 uniformly-random letters “A”, “C”, “G”, and “T”.
|
c) Use the sample function to create a vector with variable name my.dna that consists of 20 uniformly-random letters “A”, “C”, “G”, and “T”.
|
||||||
d: Use the == logic operator and other R functions on your my.dna variable to determine how many of the letters are “A”. Hint: you can use sum on a TRUE/FALSE vector or you can use the functions which and length.
|
d) Use the == logic operator and other R functions on your my.dna variable to determine how many of the letters are “A”. Hint: you can use sum on a TRUE/FALSE vector or you can use the functions which and length.
|
||||||
e: Confirm your answer in d with the table(my.dna). From the output of table, create a pie chart and barplot. Add x and y labels to your barplot.
|
e) Confirm your answer in d with the table(my.dna). From the output of table, create a pie chart and barplot. Add x and y labels to your barplot.
|
||||||
f: Use the sample function with the option prob=c(.1,.4,.4,.1)to create a vector with variable name my.dna2 that consists of 20 non-uniformly random letters “A”, “C”, “G”, and “T”. Use table to show the nucleotide counts.
|
f) Use the sample function with the option prob=c(.1,.4,.4,.1)to create a vector with variable name my.dna2 that consists of 20 non-uniformly random letters “A”, “C”, “G”, and “T”. Use table to show the nucleotide counts.
|
||||||
|
|
||||||
## Part B: NCBI Search
|
## Part B: NCBI Search
|
||||||
|
|
||||||
@ -17,9 +17,9 @@ Search NCBI (http://www.ncbi.nlm.nih.gov/) for “Alzheimer human.” This will
|
|||||||
|
|
||||||
### Evaluation
|
### Evaluation
|
||||||
|
|
||||||
a: What is the name of the gene?
|
a) What is the name of the gene?
|
||||||
b: What chromosome is the gene on?
|
b) What chromosome is the gene on?
|
||||||
c: What species has the most similar gene to the human version?
|
c) What species has the most similar gene to the human version?
|
||||||
|
|
||||||
## Part C: Reading fasta files, nucleotide and dinucleotide frequencies
|
## Part C: Reading fasta files, nucleotide and dinucleotide frequencies
|
||||||
|
|
||||||
@ -31,19 +31,19 @@ Read the fasta file in as a string
|
|||||||
|
|
||||||
### Evaluation
|
### Evaluation
|
||||||
|
|
||||||
a: What data type is the fasta?
|
a) What data type is the fasta?
|
||||||
b: Create a function that converts the fasta string to a vector
|
b) Create a function that converts the fasta string to a vector
|
||||||
c: Using the function from C.2, how long is the sequence?
|
c) Using the function from C.2, how long is the sequence?
|
||||||
d: Show the first 20 nucleotides of the sequence
|
d) Show the first 20 nucleotides of the sequence
|
||||||
e: How many of each nucleotide are there in the sequence?
|
e) How many of each nucleotide are there in the sequence?
|
||||||
f: Create a barplot of the counts, including axes labels
|
f) Create a barplot of the counts, including axes labels
|
||||||
g: Calculate the probability of each nucleotide
|
g) Calculate the probability of each nucleotide
|
||||||
|
|
||||||
## Part D: GC Content
|
## Part D: GC Content
|
||||||
|
|
||||||
a: Add code to your R script to calculate the G+C content of the fasta vector
|
a) Add code to your R script to calculate the G+C content of the fasta vector
|
||||||
b: How many gc pairs are there?
|
b) How many gc pairs are there?
|
||||||
c: Show a barplot of all dinucleotide counts
|
c) Show a barplot of all dinucleotide counts
|
||||||
|
|
||||||
## Part E: Coronavirus
|
## Part E: Coronavirus
|
||||||
|
|
||||||
@ -53,5 +53,5 @@ DNA/RNA: https://www.ncbi.nlm.nih.gov/nuccore/MN908947.3?report=fasta
|
|||||||
Protein: https://www.ncbi.nlm.nih.gov/protein/QHD43415.1?report=fasta
|
Protein: https://www.ncbi.nlm.nih.gov/protein/QHD43415.1?report=fasta
|
||||||
|
|
||||||
### Evaluation
|
### Evaluation
|
||||||
a: Download the DNA/RNA fasta file and determine the nucleotide frequencies. Comment on how the frequencies compare with the human APOE gene.
|
a) Download the DNA/RNA fasta file and determine the nucleotide frequencies. Comment on how the frequencies compare with the human APOE gene.
|
||||||
|
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user