Notes for Part II: Information and Heredity CC-BY-NC

Maintainer: admin

BIOL 112 (Winter 2011) notes for Part II: Information and Heredity (Professor Dent's section). From February 16, 2011 to whenever the semester ends.

1Introduction

  • Prof does not have office hours - email him to schedule an appointment, or just email him
  • 3 sets of practice problems - do them
  • A nice story about a central Asian monk named Temujin around 1220 AD and unknown paternity
    • (aka Genghis Khan)
    • Had to give his land to his son, but paternity issues, some sort of controversy
    • Sort of illustrates the concept of heredity, which is what we're learning now
  • The mystery of heredity
    • All organisms come from other organisms (except for the base case, which ostensibly was abiogenesis)
    • All organisms resemble their parents
    • Siblings are not identical
  • The magic that is conception
    • An egg becomes fertilised and undergoes cleavage (cell division) until eventually you get an organism

2Cell cycle

  • All organisms consist of cells, which divide to produce new cells
  • Higher organisms fuse their cells (sperm and ova) to produce a new organism
  • Cell division results in more cells, etc
  • Dividing cells devote a lot of resources to segregating chromosomes
    • i.e. organising them in a line, pulling them apart so that each cell gets an equal number of chromosomes
  • Chromosome: single string of DNA
    • Circular: bacteria
    • Linear: most other organisms (including us)
    • When a cell is ready to divide, chromosomes condense, associate with proteins (e.g. histones)
    • Combination of DNA + proteins = chromatin which is dark and easy to see
    • Normal DNA is difficult to see under a light microscope, but when it's ready to divide, easier to see
    • The DNA double helix wraps around histones, which wraps around other proteins, etc ... all bunched together
  • Karyotype: organising and identifying chromosomes
    • Take a cell that is getting ready to divide
    • Flatten it with glass (mitotic squash?)
    • Stain it with a dye, gives them stripes; take a picture and cut them up etc
    • Shows that chromosomes come in pairs - homologs (except for the sex chromosomes, which don't look like each other)
  • The number of chromosomes is characteristic of a specie (with some exceptions)
    • Humans typically have 46, roundworms have 2, pigeons 80
    • Doesn't really correspond to complexity or anything
  • Just before cell division, each chromosome has been replicated once to produce two chromatids (at the top of the chromosome)
    • Two chromatids for each chromosome in a pair, held together by a centromere
  • Normally a chromosome is a single piece of DNA
    • But when we look at a karyotype, we are seeing two pieces of DNA - the two chromatids, bound by a centromere
    • We call these mitotic chromosomes to show that they are a special case
  • Segregating chromosomes is exacting: a cell needs to give each daughter cell the right number of chromosomes
    • With random segregation, only half the time does each daughter get one chromatid (the right result)
    • So there needs to be some sort of mechanism that ensures that the right result is enforced every time
    • Organisms need at least one of each chromosome, and typically exactly one
      • more can cause problems - ex, Down syndrome - an extra chromosome 21 (which is incidentally the smallest chromosome)
  • So the chromosomes must obviously be duplicated BEFORE the cell divides
  • NOW THE ACTUAL CELL CYCLE (steps of cell division)
    • Chromosome (DNA) replication: S phase
    • Mitosis (M) somatic cells divide into daughter cells, each of which inherits one copy of each chromosome
    • OR Meiosis (M) for germ lines (gametes) - non-identical copies, creating daughter cells that have one of each homolog
    • Cytokinesis: dividing the cytoplasm in two (optional)
    • Starts with Gap 1 (G1) - cell is growing, accumulating resources it needs to divide
    • Then it duplicates its chromosomes - DNA synthesis (S phase)
    • Gap 2 (G2) - another rest phase, preparing to divide
    • Then it undergoes mitosis (M), and the daughter cells go through the cell cycle again
    • To ensure that this occurs in order, there is a checkpoint between each phase
    • From S to G2, cell checks that each chromatid has been duplicated
    • What evidence of this checkpoint do we have?
      • Drug: hydroxyurea, blocks chrosome replication
      • So cell division is put on hiatus indefinitely
      • Another: caffeine, which disables the checkpoint
      • This is usually not a problem, as chromosome replication occurs without problems
      • However, if you both block chromosome replication AND disable the checkpoint, you have problems
      • Because the cell will attempt to continue to the mitosis stage, but will find that it's not able to divide properly
  • Cell cycle tightly regulated to ensure that, in particular, each cell gets the right number of chromosomes
    • Diagram later maybe
  • More on the G1 to S transition:
    • There is a protein called Cdk4, which is always present during the cell cycle but doesn't really do anything
    • But then it associates with another protein called Cyclin D, which is only produced during S phase
    • This complex interacts with other proteins to say that the cell is ready to go into S
    • Cyclin is degraded, Cdk4 is released (no longer functional)
    • So this activity of proteins tells the cell what cycle it's in, when to start transition
    • Although cyclins and cdks are involved in all parts of the cell cycle, important in different transitions, etc
  • How do cells know when to divide?
    • Most somatic cells not dividing - arrested in the G1 phase of the cycle
    • Often waiting for signals from other cells to tell them to divide
    • For example, cells in the immune system: not usually dividing, but when there is an infection, they start dividing
      • A cell called a macrophage identifies a virus, sends a chemical signal to T-cells telling them to start dividing, and about the infection
      • So T-cells don't start dividing until they get a signal from a macrophage
    • Unregulated division of cells → cancer
    • If the G1 to S checkpoint is defective, a cell can divide in an unregulated manner (i.e. dividing when it shouldn't be)
      • For example, if cyclin E is always active or overabundant, a cell will repeatedly divide → cancer
      • If we can understand how signals regulate the cell cycle, we might be able to design drugs to interfere with these signals
      • (And thus fight cancer)

2.1Mitosis

  • Although the process of dividing cells randomly and hoping that each half gets the same stuff works for things like mitochondria and ribosomes etc, doesn't work for chromosomes
  • Cell can regulate chromosome duplication, to ensure proper segregation (unlikely to happen randomly)
    • Mitotic chromosome: the kind we're familiar with, two sister chromatids united by a centromere (only happens during mitosis, usually a messy ball - not yet condensed)
  • Mitotic spindle: mechanism for this
  • Prophase: centrosomes duplicate, migrate to opposite ends of nucleus, chromosomes start to condense
  • Prometaphase: nuclear envelope breaks down, microtubules from spindles (formed from centrosomes, at poles) can interact with chromosomes, bind to each other
    • Microtubules are growing and trying to interact with chromosomes, unstably and randomly sampling space
    • Because centrosomes don't know where chromosomes are, just looking for them with microtubules
    • Each kinetochore is given a geometry so that two kinetochore microtubules from the same spindle can't get both chromatids
    • Basically, kinetochores are the things that the microtubules can attach to, to latch onto the chromatids
  • Metaphase: all the chromosomes are lined up in the middle between the poles (equatorially), attached chromatid to centrosome (metaphase plate)
    • Chromatids must be paired and kept together until it is time to segregate
    • Checkpoint to make sure that all chromosomes are attached to microtubules and on opposite centrosomes, before continuing
  • Anaphase: centrosomes break down, so sister chromatids no longer attached, microtubules pull chromatids apart
  • Telophase: shortly before the cell actually divides, equal number of chromosomes in each half
  • Cytokinesis: only in plants, for dividing the cell
    • Vesicles between the cells are added, fuse to become the cell plate (cell wall)
    • In animals, purse string method - actin and myosin constrict, to pinch and divide the cell
    • Note that cytokinesis does not always happen - some cells have multiple nuclei per cell
    • Example, cardiac tissue is multinucleated (syncytial) - mitosis without cytokinesis
  • Errors: if one chromosome, say, goes to the wrong side, such that one daughter cell has 2 copies of one chromosome and the other has 0, both will die

2.2Meiosis

  • Sex: mixing the genetic material of two organisms, for organisms that are different from you and thus may be better adapted
  • First, need to reduce the number of chromosomes by half
  • Ploidy - the number of sets of chromosomes containing exactly one of each homolog
    • Haploid, diploid, triploid, tetraploid, pentaploid etc
    • Somatic cells are diploid, so gametes must be haploid
  • Meiosis: process by which haploid cells are made, much like mitosis
    • Meiosis I: reduction division
      • Early prophase I, chromosomes condense, move apart
      • Mid-prophase I: chromosomes are condensed, have been duplicated (so 2 of each chromosome, 4 of each chromatid)
      • Late prophase I - prometaphase, little pieces of homologous chromatids start exchanging DNA (chiasma or recombination)
      • Metaphase I: all lined up, except two columns, sort of, in the middle (so pairs of homologues lined up)
      • Checkpoint before anaphase
      • Anaphase I: homologues separate (chromatids stay stuck together), half on each side
      • Telophase I, ^
    • However, the process is not complete - still too many chromatids (diploid)
    • So then we have meiosis II: equatorial division, similar to mitosis
      • Form a spindle, microtubules grab chromatids, so you end up with four haploid cells
      • Prophase II, metaphase II, anaphase II, telophase II etc
  • While mitosis can be quick, cells can arrest in meiosis for a long time
    • E.g. ova, produced by human females; arrest, wait until they're fertilised
    • However, problems with this: Down's syndrome, happens more often with older owmen
      • A problem with meiosis I; two homologues in one cell, so daughter cells (from meiosis II) still have 2 of each chromosome (21 in this case)
      • The older the mother, the longer the egg has been "sitting around", in an arrested stage
      • And the longer the egg has been sitting around the greater the chance of imperfect duplication

2.3Ploidy

  • We're diplontic life organisms - mostly diploid, small haploid portion (fertilisation etc)
  • But there are also haplontic organisms, e.g. algae - almost completely haploid, gametes fuse, undergo meiosis immediately (so briefly diploid I guess)
  • It's not the absolute number of chromosomes that's important, but rather the ratio that matters
  • As long as the ratio is constant (e.g. 3 chromosome 1s, 3 chromosomes 3)
  • However, odd ploidys tend to be sterile (one daughter cell will have more chromosomes than the other etc)
  • Trisomy: genetic anomaly, three copies (instead of two) of a particular chromosome (type of aneuploidy)
  • You can also have triploids (sterile though) and tetraploids or even more
    • For example, store-bought strawberries ... can be octoploids (makes them bigger, still viable and fertile)
    • Another example: frog in South Africa, at one point, evolved to be tetraploid rather than diploid (whole new species); similar but larger than original

3Genetics: Mendel

  • Gregor Mendel: Austrian(/Australian) monk, really cool guy and doesn't afraid of anything
    • Pea plant experiment
    • Tried to explain why for a given character, offspring share traits with their parents, but siblings' traits not necessarily identical
  • Continuous variation: height, skin colour, etc
    • Leads to blended inheritance usu
  • Discrete variation: only a few possible traits for a given character
    • Example: pea flower colour or mouse fur colour
    • What Mendel focused on (pea plant colours)
  • Used true-breeding strains (e.g. round pea, only gives round peas when fertilising itself)
  • True-breeding round + true-breeding wrinkled = only round in F1
    • But F2 progeny, 1/4 wrinkled, omg genetics
    • Same sort of thing happens with many other traits that exhibit complete dominance
  • Vocab terms: genes, alleles (different possibilitiesfor a gene), hetero/homozygous
  • Punnett squares etc
  • Stochastic process, follows laws of probability, pretty straightforward
  • Genotype: set of alleles; phenotype: set of traits
    • Genotype uniquely determines phenotype, but phenotypes can be the result of many different genotypes
  • Alleles of the same gene segregate independently; same for alleles of different genes (law of independent assortment)
    • Example: SSYY x ssyy
    • F1: All SsYy
    • F2: 9:3:3:1 (S-Y-:ssY-:S-yy:ssyy)
  • Note on Punnett squares: if you have n genes, $2^n$ genotypes
    • Some gametes may have the same genotype; still have to include them
    • If you have more than 3 genes, just use probabiddy, not Punnett squares
  • Conclusion: heredity inherited through discrete units (alleles)
  • Mendel's discoveries were ignored until the discovery of chromosomes (discrete structures that could be responsible for this phenomenon)
    • If genes are on different chromosomes, then they can segregate independently during meiosis (arrangement across metaphase plate)
    • However, they don't HAVE to be on different chromosomes - chiasma (genetic recombination) during meiosis (?)
    • But the law apparently only applies to genes on different chromosomes
  • Usually can't see alleles, but in the case of sex chromosomes, you sort of can (XX vs. XY)
    • Y is kind of dominant - XXY is male, X is female
  • Red-green quarter blindness: sex-linked (X), recessive
    • Colour-blind females have affected fathers
    • Colour-blind males usually have unaffected parents (colour-blind dad = red herring; colour-blind mother = unlikely)
    • There is no colour-blind gene on the Y chromosome - too small I guess
  • Wild-type (predominant, >99%) vs. mutant alleles (sometimes purposely-induced mutations)
    • Polymorphic alleles - > 1% of population
  • Autosomes (non-sex chromosomes) and allosomes (sex chromosomes)
  • For genes on the same chromosome:
    • Experiment with fruit flies, either two genes are on the same chromosome or not
    • Results: the genes are on the same
    • Draw the Punnett squares for both possibilities, results matched the same-chromosome situation
    • Also tells you how the genes are arranged on the chromosomes (e.g. both dominants on the same chromosome)
    • HOWEVER, you also get some evidence that the genes are NOT linked to the same chromosome
    • Of course this is due to genetic recombination (crossover etc)
      • Recombination occurs at random points on the chromosome
      • So the probability or rate of recombination occurring between two genes depends on their distance from each other
      • The further apart two genes are, the more likely that they will be separated
      • So measuring recombination rate can tell you about the distance between two genes on a chromosome
      • This distance is measured in centiMorgans (cM) = number of recombinations / number of total progeny
      • Example: if black is 16.6 cM from vestigial and vestigial is 12 cM from curly ...
      • Then you don't know how far black is from curly, could be either 28.6 or 4.6

3.1Continuous variation

  • Results from having many genes determine each trait, and many alleles for each gene
  • Example, snapdragons and colour (red + white = pink, intermediate phenotype), incomplete dominance
    • Reason: white does not have gene for producing red pigment
    • Pink is heterozygous, so produces only half the amount of red pigment, so pink
    • But if you had many alleles and many genes you can get a true range of phenotypes
  • Although not exactly continuous variation, it does illustrate intermediate phenotypes (although there is only one)
  • Example in humans: allelic series and Alzheimer's disease
    • Different allelic combinations result in different risks of developing the disease
    • So the alleles are working additively
  • You can also have multiple genes affecting a phenotypic trait (multigenic or polygenic)
    • Example: heart disease, many genes working additively
    • Each allele acts as a single Mendelian trait, but their sum gives the actual result
  • Epistasis: interactions between alelles of different genes
    • Example: fur-colour in mice, not a single locus - two genes determine, discrete trait
    • If you have two recessive alleles for the albino locus, you are albino
    • But if you don't have two recessive alleles for the albino locus, you produce pigment
    • But the pigment colour is determined by another gene, Agouti - either brown if you have bb or black if you have B-
    • So mice can only be either brown or black or white
    • In other words, the genes assort independently, but the ratio is not 9:3:3:1 because of interactions (epistatis)
    • Instead, 9:4:3
    • Epistasis often results from genes involved in different steps along the same pathway or process
  • Environmental contribution to phenotype:
    • Penetrance: percentage of individuals with a given genotype showing a certain phenotype
    • Expressivity: the degree to which a phenotype is expressed

4DNA and heredity

  • Chromosomes: physical things that carry heredity, and they are made of DNA OMG
  • How to get the stuff that chromosomes are made from: purify them, assay the components
    • Assay: just a way of measuring something
    • For example, an allergy skin test assay, to find out what substance is causing allergic reactions
    • Separate the components, test them one at a time
    • For DNA: grind up any living organism
      • Use an organic solvent to extract lipids and proteins (e.g. phenol)
      • Precipitate with ethanol, and you get just DNA
    • Now do the assay test
      • Hypothetically, get some porcupine DNA, dip a cat into the DNA, should result in a pet porcupine RIGHT?
      • No
    • But you can actually do something similar, with bacteria (discovered by Frederick Griffith)
    • Can an extract from dead bacterial cells genetically transform living bacterial cells?
    • S strains and R strains of bacteria, S is deadly, R is harmless (when injected into a mouse)
      • So take the S strain, heat it up so that the bacteria are dead, inject in mouse - harmless
      • However, if you kill the S strain, mix it with the R strain, and inject it into the mouse, it will die
      • Essentially you have transformed the R strain into the deadly S strain, by mixing them (something transferred)
      • Transforming principle - hereditary
    • Oswald Avery, simple experiment to identify the transforming principle - turned out to be DNA (not lipids carbs etc)
    • Another experiment on this done on viruses, which can also have DNA
      • Specifically, bacteriophages which infect bacteria, inject something into the bacteria
      • Which causes more bacteriophage to be made within the bacterium
      • Bacteriophage have only DNA and proteins, so the Hershey-Chase experiment set out to show that it was DNA that was being injected
      • Done by labelling protein with radioactive sulfur and DNA with radioactive phosphate (has a phosphate backbone)
      • Then blend the mixture of phage and bacteria, so that the phage gets stripped off the top of bacteria
      • Locate the DNA after centrifuging - is it in the bacteria (pellet I think)? Or is it in the supernatant (liquid at top)
      • Shows you clearly that DNA is what is being injected

4.1Structure of DNA

  • Nucleoitodes: base (4 different kinds) + sugar (deoxyribose) + phosphate, polymer backbone (ribose-phosphate-ribose)
  • 5' phosphate at one end, 3' hydroxyl at the other end (so directionality)
  • Chargaff's rule: ratio of bases (A:T:G:C) specific to organisms (humans have a different ratio than corn etc)
    • Chargaff's analysis also showed that the ratio of A:T is 1:1, same for C:G (so those are always paired)
  • X-ray crystallography (diffraction etc, cool stuff)
    • Showed that DNA consists of two strands, double helix, antiparallel (so each end has a 5' and a 3' end)
    • Phosphates probably on the outside
    • Watson and Crick put it all together for a good model of DNA
    • Minor groove: when backbones are close together; major groove: when they're far apart
    • DNA strands are the reverse complement to each other (opposite directions, and A-T, C-G)
    • The reason A-T and C-G are paired: natural affinity of bases
      • A-T, form hydrogen bonds when they lie together in a plane with the backbone on the outside
      • C-G, form three hydrogen bonds (?) so these hydrogen bonds determines base pairing
      • Note: A, G are purines (large); T-C, pyrimidines (small bases), so no bulges; distance between two strands always the same
    • RNA, another nucleotide, can also form complementary strands, with U instead of T
  • Key to DNA: sequence of nucleotides not constrained by the structure; can accommodate any sequence
    • Each strand has the same information as the other strand - so in a sense it's already replicated
    • But how does it actually replicate (to make children etc)

4.2DNA replication

  • Original possibilities considered:
    • Semi-conservative, two strands separate, new strand for each
    • Conservative, two strands separate, make copies, those copied strands join
  • Experiment to determine what actually occurs (Meselson and Stahl):
    • To distinguish between new and old DNA - N14, light DNA; N15, heavy DNA
    • Use a centrifuge to separate them
    • Procedure: grew bacteria in heavy DNA, allowed it to replicate, became lighter etc
    • After one generation, all the DNA migrated to the middle, suggesting that all the DNA is intermediately heavy
    • Also, adding a new light generation, etc
    • So this supports the semi-conservative model - each pair of strands is half new, half old
    • If replication were conservative, this experiment after two rounds would result in 1/4 heavy, 3/4 light
      • Reason: the original would remain heavy, would make a new light strand after one replication
      • After the second round, the original would still be heavy, but the new ones would all be light
  • Making DNA in a test tube:
    • Triphosphate nucleotides
    • DNA polymerase (an enzyme)
    • DNA template (an old strand that can be duplicate), which must have ragged ends
    • Ragged ends: one of the strands sticks out, which acts as a template to stick a new base on
    • For example, if you have a G at a ragged end, you would add a C to it (formed from the materials)
    • This ragged end must be at the 3' end (so that you attach shit to the 3' base)
    • Indicates that the direction of polymerisation is 5' to 3' (so start make the 5' end first)
    • When in the lab, you create ragged ends by breaking up the DNA with phenol etc
    • But how does the cell make ragged ends (which it hates) and how does it pull them apart?
      • Solution: enzyme called helicase, unwinds the DNA (pries them apart, like unzipping)
      • Enzyme called primase, creates fake ragged ends - creates a short RNA primer
      • Then the polymerase uses the primer as a 3' end
      • Leading strand (can follow the helicase) vs. lagging strand (has Okazaki fragments)
      • How exactly does this work? Ascertain this
    • This bubble formation is not random, occurs at specific locations in DNA
    • For example, bacteria, circular chromosomes; creates two linked rings (which are then broken apart)
    • For our linear chromosomes, many different origins of replication, eventually the whole thing is replicated
  • Error correction mechanisms:
    • DNA proofreading by DNA polymerase III
      • Sometimes it can see when it made mistakes, go back, replace it with the right base
    • Mismatch repair
      • If the new strand does not match the old, an enzyme will fix it
  • Excision repair: bases are damaged, will replace them
    • Example: UV light produces thymine dimers, where thymine bases are bonded to each other
    • An enzyme can recognise this, will rip out the bases and replace them with good ones
    • Can repair single bases, multiple bases, etc

5The genetic code

  • Key to DNA:
    • DNA can accommodate any arbitrary sequence, which can encode information
    • Two strands encode the same information in complementary format - method for replication
  • Alkaptonurea: metabolic disease, urine turns black when exposed to air
    • Garrod figured out that is a recessive hereditary trait (ran in the family etc)
    • Then deduced that it results from the absence of a specific enzyme
    • So the enzyme would normally convert a certain compound into another, but as that enzyme is missing ...
    • The compound remains in that format, and when oxidised, that compound turns black
    • Thus genes correspond to enzymes, or something
    • Another example of a gene in a pathway (first example: epistasis)
  • Experiment: Beadle and Tatum
    • Show that genes determine enzymes in a biochemical pathway
    • Established: one enzyme, one gene, for a series of genes in a pathway
      • We now know: one gene, one protein (so each gene encodes for a protein)
    • In this pathway, we have compound conversion: ornithine --> citrulline --> arginine (an essential amino acid)
    • So this pathway must be intact for organisms to grow etc
    • Method: put spores of each arg (mold) mutant strain in a medium with and without nutritional supplements
      • Normally if you add the minimal number of nutrients, the wild-type can grow
      • But sometimes there are mutants that need additional supplements to grow
    • The wild-type: you can add ornithine, or citrulline, or arginine, and it will grow
    • One mutant type - will only grow if you add citrulline or arginine
    • Another mutant type - only on arginine, clearly missing an enzyme to convert shit into citrulline
    • So it was possible to identify a number of mutants, each one corresponding to a missing enzyme in the pathway
  • RNA: ribonucleic acid, how information goes from the nucleus into the cytoplasm (where proteins are actually made)
    • So DNA is converted into RNA, which carries information into cytoplasm
    • Ribosomes use this information to make proteins
    • So DNA information flows to DNA (replication), and also to RNA, then to proteins
    • Difference: uracil instead of thymine; has both hydroxyl groups on ribose (?)
    • So you can have these double-stranded hybrids of RNA and DNA
  • Transcription: enzyme = RNA polymerase
    • Enzyme unwinds the two strands of DNA
    • Starts making an RNA strand (5' to 3') complementary to ONE of the strands
    • Only goes a short distance - corresponding to how much information it needs
    • Then the RNA goes out of the nucleus, ribosomes grab onto it to make proteins
  • Fred Sanger - sequenced the first protein (insulin)
    • Showed that a sequence of amino acids is characteristic of a protein
  • 64 possible sequences, only 20 amino acids, so yeah some redundancy
  • Experiment to figure out which sequences encode for which amino acids:
    • RNA with only one of: UUU, AAA, or CCC (repeated)
    • Conclusion: each triplet is mRNA for a different amino acid
  • Gobind Khorana: AAGAAGAAG
    • Some proteins were made of LysLysLys (amino acids)
    • Others, ArgArgArg ... others, GluGluGlu
    • Due to the three possible reading frames for any sequence of DNA
    • If you do enough experiments like that, you can figure out which amino acid each codon corresponds to
    • Methionine: AUG/ATG, start codon
    • Stop codon: TAG/TAA/TGA (or with U)
    • Code is degenerate: while DNA uniquely determines the amino acid sequence, the reverse is not true
  • How does the ribosome know how to interpret the RNA sequence?
    • Unlike in DNA, where one strand has natural hydrogen bond affinity to the other strand, this is not true for ribosomes
    • No affinity of amino acids to specific codons in RNA
    • So there is another type of RNA - transfer RNA, has a 3' and 5' end, and has an anticodon
      • The reverse complement of the codon that encodes for the amino acid
      • There is 1+ tRNA for each amino acid
      • So for CAT the tRNA would be GTA I guess
      • Each tRNA has a unique shape
      • So the tRNA binds to the mRNA, and the correct amino acid is on the tRNA (other side)
      • Which is how you get the amino acid, the amino acids join up etc
    • How does the cell know which amino acid to attach to a specific tRNA?
      • Answer lies in the structure of the tRNA itself - shape, also tRNA synthase
      • Enzymes recognise specific tRNA and amino acids, attach them to each other
      • If you, say, converted an amino acid on tRNA to something else, the ribosome wouldn't know, would just blindly insert it
    • This code, shared by organisms ... evidence for a single common ancestor, presumably around 6000 years ago

5.1Transcription and translation

  • Clicker question: if the enzymes for converting a precursor to ornithine and ornithine to citrulline were missing ...
    • What would the neurospora grow on?
    • Answer: arginine or citrulline. Since citrulline can still be converted to arginine.
  • One DNA strand has the same sequence as the mRNA (sense/coding strand)
    • This is the strand we look at if we want to see what amino acids are being produced, etc (just change T to U)
    • The other one acts as the template - the antisense strand (reverse of mRNA)
    • So the DNA sequence that is actually transcribed is the reverse complement of the mRNA
  • Some genes are transcribed in the other direction - from the other strand of DNA
    • It's more or less random which strand of DNA actually gets transcribed
  • How does the RNA polymerase know where to start?
    • Binds to a specific DNA sequence upstream of the place to transcribe, called a promotoer
    • So just in front of the portion of the sequence that needs to be transcribed
    • So the promoter recognises this stretch of DNA by its sequence (e.g. TATA box lol)
    • All genes have some kind of promoter - not necessarily the same promoter of course
    • So these promoters are encoding information, but not for amino acids
  • The three-frame problem: how does the cell know which reading frame to choose?
    • Experiment: confirmed that coding regions always begin with methionine (ATG)
    • The first methionine codon that the ribosome can find signals the start of the reading frame
    • Ribosome consists of two subunits, one large one small
    • Small one recruits a methionine tRNA
    • Large subunit will sit on the p-site
    • Will start going along, grabbing tRNA as it goes along
    • Will replace each tRNA by another one as needed
    • N/amino-terminus: start of sequence
    • Ends at a stop codon (release factor - peptide that causes the compound to be released when it encounters a stop codon)
  • Wobble pairing:
    • Although there's a tRNA for every amino acid, there isn't tRNA for every codon
    • During the interaction of the anti-codon with the codon
    • For example: CAU should have AUG as its anticodon (5' to 3') but instead, it's GUG
    • Rules determine which wobbles work, depending on the anticodon position (5' end or 3' end)
    • tRNA also has a different kind of base called Inosine (I), can pair with A/U/C in the anticodon position
    • Hence, the degeneracy of the genetic code, and why you don't need one tRNA for every codon
    • (Because some tRNAs can bind with more than one type of codon etc)
    • Know how to determine possible anticodons made on codon sequences for amino acids and the Wobble pairing rules
  • Transcription and translation can be simulatneous
    • In prokaryotes, ribosomes can start translating mRNA before transcription is complete
      • Not possible in eukaryotes, because ribosomes can't get into the nucleus, where transcription is occurring
    • Proteins can begin to function before translation is finished (before they've been completely synthesised)
  • Polysomes - many ribosomes for a single strand of mRNA
    • Like a train of ribosomes, all producing the same peptide simultaneously

5.2Mutations

  • Demonstration of link between mutation and phenotype - sickle-cell anemia (autosomnal recessive trait)
    • Unusually high prevalence among Africans (heterozygous for it) - link to malaria lol
    • Main protein in blood: hemoglobin, binds to oxygen and carries it around the body
    • But in sickle-cell anemia, hemoglobin will form fibres, under certain conditions, instead of being soluble in the blood
    • Cause red blood cells to stretch, break and eventually die
    • Cause: change in the hemoglobin amino acid sequence
    • Experiment done by Linus Pauling - cut up hemoglobin into small pieces, using enzymes
      • 2-D gel to separate the peptide fragments based on their charge, in one direction (different amino acids have different charges)
      • And in another direction based on their size
      • So where peptides end up depends on their amino acid sequence
      • And it turns out there is one peptide in a different location for sickle-cell patients
      • Which is likely due to a different amino acid sequence
      • Later, it was sequenced, confirmed - sickle-cell anemia caused by a single base pair change
      • Results in a single different amino acid - valine instead of glutamic acid - which is enough to cause sickle-cell anemia
  • Classifying mutations:
    • Point mutations - small change in DNA (e.g. one base pair changed), usually affect one gene
      • Missense mutation: single nucleotide change --> different amino acid produced, so the protein is different (e.g. sickle cell)
      • Nonsense mutation: when something becomes a STOP codon, protein truncated early
      • Deletion (frame shift): individual nucleotides deleted, changes both the amino acid and causes a frame shift ... subsequent amino acids different
      • Insertion (frame shift): same as above except adding in a random nucleotide
      • Deletion (no frame shift): if you delete 3, you get rid of one amino acid, but there is no frame shift (skipping an amino acid)
      • Insertion (no frame shift): same as above but adding 3 (or a multiple thereof)
      • Silent mutation: no effect, due to degeneracy in the code (for example, TAG because TGA)
    • Chromosomal mutations - large changes in chromosomes, usually affect many genes
      • Deletion; delete a large chunk of a chromosome, removing many genes in the process (ex: bands missing in chromosome)
      • Duplication and deletion: unequal crossing-over (during recombination) - one chromosome has extra genes, the other, missing
      • Inversion: a piece of the chromosome is oriented backwards, often has no detectable effect if the break is between genes
      • Reciprocal translocation: recombination occurring between non-homologues (which shouldn't be recombining)
  • Mutagen: agents that cause changes in the DNA sequence
    • chemicals, radiation, viruses etc
    • Of course, even replicating DNA itself can cause mutations
  • Mutations can occur in two different cell populations - somatic cells and germ line cells
    • If a somatic cell gets mutated, then further somatic cells may be mutated, but can never escape the body
      • Can kill a cell, or make it cancerous, or harm it or do nothing or whatever
      • Transmitted to daughter cells, but never to progeny
    • If a germ line cell gets mutated, then progeny may be mutated too ... that allele can be a polymorphism in the gene pool

6Molecular genetics of prokaryotes

6.1Viruses

  • Viruses first discovered by Dmitri Ivanovsky - Tobacco mosaic virus
    • Many tobacco plants dying of a disease
    • The only infectious agent known at the time = bacteria
    • But when he ground up the leaves, looked under a microscope, couldn't see them
    • And it had to be an infectious agent, not a toxic agent - even when diluted and transferred many times, still infectious
    • If it were bacterial, using a Chamberlain filter would get rid of the infections
    • (Filter whose holes were large enough for water to pass through but too small for bacteria)
    • However, when these were used, the infection was still transmitted ... so the infectious agent must be smaller than bacteria
    • So viruses were defined base on their disease-causing abilities and size
    • Initially, viruses defined based on their host specificity
    • Then later, based on their genetic material (some have only RNA, no DNA)
    • Several forms of DNA:
      • Single-stranded DNA
      • Double-stranded DNA (e.g. chicken pox virus)
      • Single-stranded RNA (e.g. tobacco mosaic, influenza)
      • Double-stranded RNA
      • The above can all be linear, like us, or circular, like bacteria
  • Bacteriophage - viruses that infect prokaryotes
    • Infects a bacterium, injects its DNA into it, uses the bacterium's ribosomes to replicate
    • Eventually you've digested all the bacterial genes, and lots of phage proteins etc
    • With T4, there is a lytic cycle - burst out of the bacterium, infect many others
    • Lysogenic phage - phage injects chromosome into the bacterium's DNA (now called prophage, continuous with bacterial chromosome)
      • Might be done if conditions are not good - don't want to infect others just yet
      • And the bacterium does not know that its DNA has phage DNA, so it keeps replicating
      • In the process, the virus is replicated in all the host's offspring lol
    • Some phage have both cycles, others are only one or the other
    • When lysing, it's important to assemble all your phage parts before you lyse
      • Done using promoters
      • Elements in front of genes, indicate when they should be transcribed
      • Early genes need to be transcribed first, then they can turn on the "late" genes, which are responsible for lysis
    • Phage are capable of exchanging DNA
      • In a high multiplicity infection (when you can have more than phage affecting a bacterium at the same time)
      • Phage have only one single chromosome
      • If they have different alleles, they can recombine, like in prophase I of meiosis ... cis to trans and vice versa
      • Can influence what the plaques look like (dark/light/small/large etc)
  • Eukaryotic viruses usually have an additional layer of complexity (often)
    • Glycoprotein envelope on outside (coat)
    • Lipid bilayer membrane, with the proteins above embedded in them (sometimes, not always)
    • Nucleocapsid
    • Viral RNA/DNA genome
    • Binds to surface of the cell due to proteins on their membrane binding with proteins on surface of cell
    • Responsible for tropisms of viruses - the certain types of cells that they like to infect
    • Allows them to get inside the cell through a vesicle; the two membranes fuse, virus released into cytoplasm
    • Virus genetic material is transcribed; proteins made
    • These proteins go through the Golgi, then bud off, forming new virus particles (using the cell machinery - co-opting)
  • Eukaryotic viruses can exchange genetic material
    • H1N1 - mixture of various influenza strains
  • HIV retrovirus - have RNA as genetic material when they enter the cell
    • Also carries a protein called reverse transcriptase
    • Known to bind to proteins (CD4, used for finding infectious agents) on the surface of immune system cells
    • The RNA enters the cell, then makes a DNA copy of its RNA genetic material
    • Then inserts DNA copy of genome into the genome of the host, damn
    • And once the cell divides, the host will make RNA copies of the retrovirus' DNA, which is just the original RNA
    • So it will make new viruses just by replication ... the virus is now part of the host cell's genome
  • Chicken pox - caused by a double-stranded DNA virus (varicella)
    • Once you get over chicken pox, the virus travels through nerve cells, infects the spinal cord, lies there dormant
    • So you can get recurring infections - e.g. shingles (from zoster), virus becomes active again
    • Member of the herpes virus family - characterised by reemergent infections, difficult to get rid of

6.2Bacteria

  • Bacterial replication - usually happens asexually (binary fission)
    • Experiment with phenotypes - ability to grow on different types of media
    • One strain of E. coli, needs methionine and biotin for growth
    • Another strain, threonine and leucine required for growth
    • When you combine them, some bacteria don't need either ... due to genetic material being exchanged
    • Conjugation: sex pilus, tube formed between two bacteria, one feeds plasmids into the other, which has none
    • F-plasmid: necessary for forming conjugations; an F+ bacterium can form a conjugation tube
    • Then the bacteria that receives the plasmids becomes F+, can also form conjugation tubes
    • This process also allows chromosomal DNA to be moved
    • The plasmid gets inserted into the chromosome - Hfr (high frequency of recombination) strains
    • Bacteria have circular DNA, so they just insert themselves somewhere in there
    • Or, they can just recombine partly (like in prophase of meiosis I), the non-used part degrades; the rest divides
    • Experiment: allow bacteria to start to conjugate, then interrupt the process
      • Frequency of a gene being transferred through a tube depends on how close it is to the origin of replication
      • Allowing you to map the genes
  • Transduction: "hitchhiking on phage"
    • Phage accidentally carrying around a piece of bacterial chromosome
    • They can't replicate, but they can insert it into other bacteria, spreading those genes around, lol
  • Transformation: moving DNA around on plasmids
    • Some bacteria, enzymes increase in level to take advantage of increased lactose in the environment
    • In E. Coli - three genes responsible for metabolising lactose (Beta-galactosidases), have a promoter
    • The three genes form an operon - transcribed onto same mRNA; ribosomes make them into separate proteins
    • Lac repressor - between the promoter and genes; lac operator, prevents polymerase from binding and making RNA from these genes
    • Basically represses these genes during specific conditions (repressor molecule, binds to DNA)
    • But when lactose gets into the cell, lactose binds to repressor molecule ... allosteric regulation
    • The repressor molecule no longer binds to DNA, so the gene gets turned on basically
    • The reverse is also possible - the regulation of tryptophan formation
      • If tryptophan is present, tryptophan binds to repressor, the repressor blocks the polymerase from making more enzymes
      • So, a negative feedback mechanism to create an appropriate amount of tryptophan (homeostatis)
    • What if you replaced the lac operon with the trp operon?
      • Cells can build sort of complex logic circuits using these repressors
      • Sophisticated regulation based on what's present in the cell at the time
    • There are also activator proteins - cyclic AMP receptor protein
      • Cyclic AMP is usually present in high quantities in the cell in the absence of glucose
      • When it binds with protein kinase A, encourages polymerase to come and transcribe this gene
      • Bacteria weighs the relative advantage of glucose vs. lactose

7Molecular genetics of eukaryotes

  • Back to ploidy - greater ploidy, greater size
    • One possibility: just has more cells (not true)
    • Other possibility: just has bigger cells (this is true, generally)
    • Cell size roughly correlated to genome size
    • So, possible to find impressions in bones of dinosaur fossils, which give the size of cells
    • From that, you can get the size of the dinosaur DNA
    • Genome size usually given in number of base pairs - the haploid number (so number of base pairs PER chromosome)
      • So humans have 6 billion bp per cell but 3 billion base pairs per chromosome
    • Although you'd expect humans to have many genes based on numbers in other organisms (e.g. roundworms)
    • Humans actually only have about 30,000 genes
    • Turns out humans have many more genes than they actually need
    • Large stretches of non-coding DNA between genes that actually code for things
    • Exons: things that code for proteins; introns: don't encode for proteins
    • When the RNA gets transcribed from the DNA, all of it gets transcribed into a primary RNA transcript
    • Then the introns get cut out, and the exons get spliced together, so you have a continuous RNA strand, all of it coding for amino acids
    • Experiment that showed this: heat up a sequence with an intron between exons
      • Hybridisation: forms loops, so the exons join together
      • Through heating up and cooling down ?
    • There are enzymes that do this
    • Small nuclear ribonuclear protein - enzyme that consists of protein and RNA (RNA strand, essential component of enzymatic activity)
      • Recognises specific sequences at a 5' splice site
      • Then loops an intron out, sort of ... splicing
      • Example of RNA processing (doing things to the RNA before it's translated by ribosomes)
      • Others: getting a 5' G gap, and polyadenylation (addition of a poly A tail)
      • G-cap: 5' end, guanine binds to the other 5' end, poly A's added to the end
      • Tells the cell that this is mRNA that needs to be translated
  • Purpose of junk DNA?
    • Telomeres: protects the ends of DNA, because there's an inherent flaw in the way DNA polymerisation occurs
    • When ready to replicate DNA, separate the strands, add an RNA primer (3' hydroxyl to start the process) - helicase + primase
    • So DNA polymerase III can then come along, use the primer to start transcribing
    • But then DNA polyermase I rips out the RNA, and there's a gap at the end, since it's a linear chromosome; that end not filled in
    • If you do this once, you get recessed ends on both ends of the chromosome
    • Every time you replicate the DNA, it gets shorter ... until eventually you have no chromosomes
    • Cell deals with this by creating telomeres. Short stretches of extra DNA, that are useless, on the ends of chromosomes
    • Acts as a buffer so that when it's lost, it doesn't matter
    • Telomerase comes along, adds a lot of random DNA to the end of a sequence
    • Telomerase is very important:
      • Most human cells don't express telomerase; as DNA replicates, chromosomes keep getting shorter
      • Until they get too short, triggers a checkpoint, cells stop dividing
      • Part of what happens in human aging - your telomeres start getting shorter, eventually you run out of telomeres
      • Also, cancer cells often express telomerase, so they can divide indefinitely
    • Clicker question: 100 six-base-pair telomeric repeats on the end of chromosomes
      • And primers are 25 base pairs long
      • You get maximum 24 divisions before you lose all your telomeres
    • Some non-coding DNA actually has a function
      • Centromeric regions tend to have highly repetitive DNA, which is how cells know where to find a centromere
    • However, the vast majority of DNA that doesn't code for cellular proteins = transposable elements
      • Pieces of DNA that don't really have a purpose
      • Transposons: encode proteins called transposase in their middle
      • Retrotransposons: look like retroviruses, only they aren't infectious
      • Non-LTR retrotransposon: have reverse transcriptase
      • Alu sequence: something
      • Transposons: have inverted repeats on either end, they can excise themselves, insert themselves somewhere else
      • Sometimes they land in a cellular gene, disrupting the gene and causing a mutation
    • Reason there are so many transposons:
      • When one moves somewhere else, it leaves a gap
      • Cell fills in the gap with whatever is on the homologous chromosome
      • So you get another transposon where it used to be
    • Retrotransposons have an RNA intermediate, require reverse transcriptase
      • Reverse transcriptase gene gets made into a protein by the cell
      • Makes DNA from the RNA, that DNA goes back into the genome, inserts itself somewhere
    • Non-autonomous non-LTR: get transcribed, piggyback on reverse transcriptase made by other genes in the genome, same thing
  • Pseudogenes: things that look like cellular genes, but without the introns
    • So when reverse transcriptase acts on mRNA, making DNA that then gets inserted back into the genome
    • But, doesn't have a promoter or anything, so they can't actually do anything
  • Conclusion: genome is a complex ecosystem, transposable elements compete for survival
  • Which came first? polymerase gene (the DNA) or the polymerase enzyme (the protein)?
    • Chicken and egg problem lol
    • Answer came from the study of self-splicing introns
    • Experiment: put RNA in test tube without any enzymes, as a control
    • Supposed to be a negative control ... but the RNA still got spliced
    • Turns out the RNA can fold into a structure that has catalytic properties, which then splices itself out
    • Ribozyme - RNA enzyme; can catalyse a variety of reactions just like proteins
    • Can hybridise to itself
    • Example: ribosomes are actually ribozymes, just RNA with a few unimportant proteins associated with it
    • So the first enzyme could have had just RNA, and this RNA resulted in ribozymes
    • Maybe there was RNA polymerase that could replicate itself
    • So primitive organisms could have been made by RNA that also acted as a lipid membrane
  • Housekeeping genes ..?
    • Cells that do not have a particular protein usually don't have the mRNA for it
    • So regulating the transcription of a particular gene - main method of controlling which genes are present
    • In both eukaryotes and prokaryotes, RNA polymerase can regulate transcription due to promoter element (TATA box)
    • Which is usually about 25 bp upstream of the actual gene
    • Eukaryotes have three polymerases
      • I transcribes rRNA
      • II: mRNA (most important)
      • III: tRNA and other small RNAs
    • Polymerase doesn't just bind to the DNA and transcribe; proteins of the TFII family prepare the way
    • TFIID - complex of proteins including TBP, binds to the TATA box
    • TFIID recruits other TFIIs that eventually recruit the polymerase II
    • Enhancers and silencers - unique to eukaryotes
      • Can act from a great distance away
      • Can be inverted and will still work
      • Transcription factors bind to specific sequences; protein domains probe major groove of DNA
      • These factors determine what proteins are synthesised in different tissues
      • Example: heat shock genes
      • Genes can mix and match silencers and enhancers to get a unique pattern of proteins expressed
    • How histones affect DNA transcription:
      • Heterochromatin, stains darkly; not transcribed in contrast to the lightly-stained euchromatin
      • For example, in females, one X chromosome is inactivated by condensing it, so it becomes unreadable
      • This happens early during embryogenesis; one chromosome chosen at random for inactivation in the cell
      • Then every daughter cell will also have the same chromosome inactivated (in the form of a Barr body)
      • Different cells will have different X-chromosomes expressed, so different parts of a female can have different phenotypes
      • Example, Calico cat - one yellow and one black allele
      • Some parts of the body, yellow allele activated; others, the black allele activated, resulting in patches
    • Rate of transcription can also be controlled by having multiple copies of the same gene
      • Example: humans have like 280 rRNA genes, because they need them (to make ribosomes)
      • However, frogs have millions, because they need to be able to make tadpoles quickly
      • Called gene amplification - some cancer cells do this to become resistant to anti-cancer drugs
    • Alternative splicing
      • Sometimes, when introns are spliced out, so are exons by mistake
      • Results in proteins that are missing chunks
      • For example, you can mutate the doublesex gene in Drosophila to create transgendered flies
      • So you suppress neither femaleness nor maleness
    • RNA stability - must be unstable otherwise you only need to make RNA once and it would be around forever lol
      • Example: excess tubulin binds to tubulin RNA, decreasing stability ... negative feedback loop
    • Control of translation
      • Changing the amount of capping
      • Factors binding to the RNA to prevent ribosome attachment
      • RNA interference: cell makes small RNA, does not encode protein, but complementary to mRNA that does
      • This RNA (siRNA) then binds to that mRNA with the help of the protein Dicer, blocking its translation
    • Post-translational controls:
      • Kinases: enzymes that phosphorylate other proteins (important way proteins control each other's activity)
      • Selective protein degradation - proteins that there are many of are marked for degradation
    • Trade-off between efficiency and speed
      • Regulation is efficient because unnecessary proteins are not made
      • So no energy is wasted, but it takes longer or something
      • Post-translational regulation is faster but not very efficient, you make proteins that do nothing until they're activated

8Recombinant DNA technology

8.1Cloning

  • Genetic engineering: taking a single gene from one organism, putting it in another, perhaps altered
  • Diabetes type I:
    • Normally, when blood glucose is high, pancreas releases insulin, causes glucose to be absorbed
    • When low, pancreas releases glucagon, liver releases glucose into bloodstream for other cells to use
    • In diabetes, insulin not produced; you die, bathed in glucose
    • 1921, Banting and Best purified insulin from dogs, could treat diabetes
    • Pigs were later used to produce massive amounts of insulin
    • But pig insulin isn't exactly the same - humans often had an immune reaction to it
    • Turns out there is one amino acid difference between pig and human insulin
    • 1970s, idea - take the human gene for insulin, put it in a plasmid, transform into bacteria
      • First problem: bacteria can't splice, so you have to put the gene in without introns
      • Do this by making a DNA copy of the mRNA that the introns have already been spliced out of
      • To do this, use the reverse transciptase enzyme from a retrovirus --> produced cDNA
      • Second problem: how do you find the right mRNA for making insulin?
      • Answer - you don't. Just take all the of the mRNA in the cell
      • Put each mRNA's cDNA into a different bacterium, then try to figure out which one produces insulin
      • Plasmid used to carry the DNA = vector
      • Cut the plasmid so that it is linear
      • Then attach each end of a cDNA to the plasmid, recreating a circle
      • DNA is cut using a class of enzymes - restriction endonucleases/enzymes
      • Proteins that identify a specific 6-base-pair (usually) stretch of DNA, cut it at that point for both strands
      • This specific stretch called the restriction site
      • Then, pasting the two ends together requires ligase to form phosphodiester bonds between cDNA and plasmid
      • Usually, put a gene for antibiotic resistance on the plasmid (e.g. ampicillin resistance)
      • So if we put in some ampicillin, only the bacteria that have the resistance gene will grow
      • This phenotype (ampicillin resistance) called the transformation marker
      • Creates a cDNA library - every member of a colony will have identical plasmids, all colonies, different cDNA
      • How do you find the colony that has the insulin cDNA? Use a DNA probe (piece of DNA used to find complementary DNA)
      • Basically you heat a stretch of DNA, which pulls the strands apart
      • If you let them cool, the strands will find each other again due to hydrogen bonds between the base pairs (hybridisation)
      • The longer the stretch of the complementary sequence, the greater the affinity to the other strand
      • So you synthesise a short stretch of radioactive DNA that is complementary to the sequence you're looking for
      • Then you expose the cDNA library to the probe - should stick to the right colony
      • You can then find the radioactive colony - that's the one you want
      • For insulin, deduce a sequence that codes for insulin and synthesise the corresponding sequence
      • Due to the degeneracy of the code, you have to use a mixture of all the possible corresponding sequences
      • Now you have found the human gene for insulin and put it in bacteria
      • Next step, make the bacteria express the gene - give it a promoter and terminator so RNA polymerase will recognise it as a gene
      • Back to restriction enzymes - each one leaves a different, characteristic overhang when it cuts
      • And the sticky ends will try to find each other
      • So to get the insulin gene expressed: cut it out of the library vector with a restriction enzyme
      • Cut the expression vector that has the promoter with the same enzyme
      • Then you mix the two, let the ends find each other; again, add ligase to make sure it stays
      • Problem: sticky ends usually just go back to each other, instead of combining the way we want them to
      • To make sure the genes combined the way we want them to, we can look at the plasmid DNA using gel electrophoresis
      • Because DNA is negatively charged, will move in an electric field towards the cathode (positively charged end)
      • If you force the DNA to move through a thick tangle of agarose fibres (strings of sugars), will move slowly
      • How slowly, depends on the size, so larger DNA moves more slowly
      • If we know which size insulin should be, then we can find the right DNA
      • Now, the bacterial RNA polymerase will recognise the insulin-making gene, will transcribe and translate it
      • So it will make insulin. You can also turn this on and off with lactose

8.2Gene mapping

  • Positional cloning - cystic fibrosis
    • Inherited, autosomnal recessive, respiratory infections - can't move mucus out of lungs
    • Purpose: find the specific chromosome and then the specific region the gene for cystic fibrosis lies
    • We could then identify and clone the gene
  • Problems:
    • Not that many human traits that show simple Mendelian inheritance against which you can map the disease trait
    • Solution: restriction fragment length polymorphisms
    • Variations in these sites will vary from one individual to the next ?
    • So, cut DNA with BamHI; you can then test for a particular RFLP using a PCR test (polymerase chain reaction)
    • PCR = method of producing large quantities of a short stretch of DNA using a DNA template
      • Idea: two DNA primers that flank the stretch, one complementary to each strand
      • Mix the primers with the single strand of template DNA, heat, allow hybridisation (annealing)
      • Then use DNA polymerase to synthesise complementary strands of each piece
      • Primers bind to the 3' stretch of the DNA to be amplified on each complementary strand
      • Initiate polymerisation in different directions; you now have two double strands
      • Repeat the process; you get a geometric increase in the amount of DNA between the primers
      • So basically this amplifies a small stretch of DNA enough that you can run it on the gel
    • Problem with PCR - heating the DNA destroys the polymerase in the process
      • Rather than add new polymerase each round, able to isolate polymerase from Thermus aquaticus
      • Bug that lives at 95 C in Yellowstone national park, obviously able to withstand high heats
    • So then you amplify the region of the chromosome that has the RFLP of interest
      • Then cut the amplified DNA with restriction enzymes, see if it cuts
      • If it cuts you have one large band; if not, two shorter bands
      • As a result, you know what alleles of the particular RFLP are present in that individual
      • Behave in a Mendelian fashion - you can distinguish homozygotes from heterozygotes (both long and short bands)
    • Where do you get RFLPs from in the first place? trial and error
      • Sequence the DNA of individuals, look for common polymorphisms that change the sequence of restriction sites
      • Look at people who have family histories of CF, look at their RFLPs, you can find RFLPs linked to the CF gene
      • You can then look at the human genome and figure out where on the chromosome the CF gene must be, roughly
      • But you still need to figure out which of the genes in that region is the CF gene
      • One way is through a cDNA library
      • Other way, computers - bioinformatics; look through the genome, find similar things etc
      • Find the difference between the CF gene in people with CF and people without
    • Sequencing DNA:
      • Synthesise a new strand of DNA using the DNA of interest as a template
      • But make it inefficient at one nucleotide, so that it stops every time that nucleotide is added
      • Then you run a gel, see how long the synthesised DNA is, tells you how far it travelled before it stopped
      • Which tells you where that nucleotide is
      • Do this for all four nucleotides ... you can figure out their sequence
      • To make it inefficient, mix in dideoxyNTP which lack the OH's necessary for polymerase to continue
      • Do four reactions, each with a different dideoxyNTP model, four mixtures of DNA with different lengths
    • Turns out, affected individuals have a small 3-base-pair deletion in an exon; removes single phenylalanine
    • The CF gene encodes a salt chloride channel - regulates chloride flow and thus controls salt balance
    • When it is defective, then the mucus that coats epithelial cells is too dry
    • Possible treatment: put correct CF gene on an adenovirus, infect CF patients with it, will replicate (not very successful)
    • CF has led to increased resistance against typhoid fever in the past, probably why it is still around