Difference between revisions of "How are the repeats determined?"
From WormBaseWiki
Jump to navigationJump to search(2 intermediate revisions by one other user not shown) | |||
Line 1: | Line 1: | ||
Repeats are determined in several ways. | Repeats are determined in several ways. | ||
− | <br> | + | *1) [[http://www.repeatmasker.org RepeatMasker]]<br> |
− | There are RepeatMasker libraries available for ''C.elegans ''and''C.briggsae'' are available from the [[http://www.sanger.ac.uk/Projects/C_elegans/REPEATS/ | + | There are RepeatMasker libraries available for ''C.elegans ''and''C.briggsae'' are available from the [[http://www.sanger.ac.uk/Projects/C_elegans/REPEATS/ Sanger Institute pages]]. These pages also have some description of the motifs identified. They can be found in the GFF files thus . . <br> |
+ | |||
+ | <span style="font-family: Courier New;">CHROMOSOME_III RepeatMasker repeat_region 9559 9734 837 . . Target "Motif:PALTA5_CE" 126 307 | ||
+ | </span> | ||
+ | |||
+ | |||
+ | |||
+ | *2) [[http://tandem.bu.edu/trf/trf.html Tandem Repeat Finder.]] by G. Benson | ||
+ | |||
+ | These can be found in the GFF files thus . . | ||
+ | |||
+ | <span style="font-family: Courier New;">CHROMOSOME_III tandem tandem_repeat 5079 5117 55 . . Note "3 copies of 14mer"</span> | ||
+ | |||
+ | <span style="font-family: Courier New;"></span> | ||
+ | |||
+ | |||
+ | |||
+ | *3) <span style="font-weight: bold;"></span>[[http://www.ncbi.nlm.nih.gov/pubmed/7514951?dopt=Abstract dust]]a low-complexity filter for nucleotide sequences (available from WS193)<br> | ||
+ | *4) inv - inverted repeat finding tool by R. Durbin (unpublished) | ||
+ | |||
+ | <span style="font-family: Courier New;">CHROMOSOME_III inverted inverted_repeat 9482 9734 69 . . Note "loop 877, 3 gaps</span>" | ||
+ | |||
+ | |||
+ | |||
+ | [[Category:User Guide]] | ||
+ | [[Category:Curation]] |
Latest revision as of 23:32, 13 August 2010
Repeats are determined in several ways.
- 1) [RepeatMasker]
There are RepeatMasker libraries available for C.elegans andC.briggsae are available from the [Sanger Institute pages]. These pages also have some description of the motifs identified. They can be found in the GFF files thus . .
CHROMOSOME_III RepeatMasker repeat_region 9559 9734 837 . . Target "Motif:PALTA5_CE" 126 307
- 2) [Tandem Repeat Finder.] by G. Benson
These can be found in the GFF files thus . .
CHROMOSOME_III tandem tandem_repeat 5079 5117 55 . . Note "3 copies of 14mer"
- 3) [dust]a low-complexity filter for nucleotide sequences (available from WS193)
- 4) inv - inverted repeat finding tool by R. Durbin (unpublished)
CHROMOSOME_III inverted inverted_repeat 9482 9734 69 . . Note "loop 877, 3 gaps"