<?xml version="1.0" encoding="utf-8"?>
<!-- generator="FeedCreator 1.7.2-ppt DokuWiki" -->
<?xml-stylesheet href="http://wiki.genedrift.org/lib/exe/css.php?s=feed" type="text/css"?>
<rdf:RDF
    xmlns="http://purl.org/rss/1.0/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
    xmlns:dc="http://purl.org/dc/elements/1.1/">
    <channel rdf:about="http://wiki.genedrift.org/feed.php">
        <title>GeneDrift Wiki</title>
        <description></description>
        <link>http://wiki.genedrift.org/</link>
        <image rdf:resource="http://wiki.genedrift.org/lib/images/favicon.ico" />
       <dc:date>2012-05-19T09:10:46-04:00</dc:date>
        <items>
            <rdf:Seq>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=beginning_python_for_bioinformatics&amp;rev=1243006163&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=bioinformatics_workshop_2010_-_department_of_pathology_and_molecular_medicine&amp;rev=1282138004&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=gd_wiki&amp;rev=1281051594&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part1&amp;rev=1264359087&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part10&amp;rev=1242317692&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part11&amp;rev=1259955530&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part12&amp;rev=1242834101&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part13&amp;rev=1242853116&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part14&amp;rev=1242919422&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part15&amp;rev=1243006104&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part16&amp;rev=1243006578&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part2&amp;rev=1242056287&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part3&amp;rev=1242057020&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part4&amp;rev=1242315514&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part5&amp;rev=1242138582&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part6&amp;rev=1265883680&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part7&amp;rev=1242833675&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part8&amp;rev=1242225368&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=part9&amp;rev=1242316151&amp;do=diff"/>
                <rdf:li rdf:resource="http://wiki.genedrift.org/doku.php?id=start&amp;rev=1240941445&amp;do=diff"/>
            </rdf:Seq>
        </items>
    </channel>
    <image rdf:about="http://wiki.genedrift.org/lib/images/favicon.ico">
        <title>GeneDrift Wiki</title>
        <link>http://wiki.genedrift.org/</link>
        <url>http://wiki.genedrift.org/lib/images/favicon.ico</url>
    </image>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=beginning_python_for_bioinformatics&amp;rev=1243006163&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-22T11:29:23-04:00</dc:date>
        <title>beginning_python_for_bioinformatics</title>
        <link>http://wiki.genedrift.org/doku.php?id=beginning_python_for_bioinformatics&amp;rev=1243006163&amp;do=diff</link>
        <description>The posts from Beginning Python for Bioinformatics are and will be included here, allowing for a better experience when reading the entries.



Beginning the begin


This website uses as a premise the book:

Beginning Perl for Bioinformatics by James Tisdal</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=bioinformatics_workshop_2010_-_department_of_pathology_and_molecular_medicine&amp;rev=1282138004&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2010-08-18T09:26:44-04:00</dc:date>
        <title>bioinformatics_workshop_2010_-_department_of_pathology_and_molecular_medicine</title>
        <link>http://wiki.genedrift.org/doku.php?id=bioinformatics_workshop_2010_-_department_of_pathology_and_molecular_medicine&amp;rev=1282138004&amp;do=diff</link>
        <description>Thursday August 5th

Topics covered:


Basic Linux/Unix commands: cd - ls - mkdir - wc - grep


Introduction to the BASH shell - online book

Simple for loops in BASH 

Simple FASTA file manipulation in BASH


BioMart -  Martview

Blast+ -  Blast book, based on the legacy version but still relevant</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=gd_wiki&amp;rev=1281051594&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2010-08-05T19:39:54-04:00</dc:date>
        <title>gd_wiki</title>
        <link>http://wiki.genedrift.org/doku.php?id=gd_wiki&amp;rev=1281051594&amp;do=diff</link>
        <description>This wiki will contain entries from  Beginning Python for Bioinformatics and some information about software projects.

 Beginning Python for Bioinformatics

Software

Bioinformatics Workshop 2010 - Department of Pathology and Molecular Medicine</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part1&amp;rev=1264359087&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2010-01-24T13:51:27-04:00</dc:date>
        <title>part1</title>
        <link>http://wiki.genedrift.org/doku.php?id=part1&amp;rev=1264359087&amp;do=diff</link>
        <description>Part 1

Hands on code: Sequences and strings 1

As pointed in Beginning Perl for Bioinformatics, a large percentage of bioinformatics procedures deals with strings, especially DNA and amino acids sequence data. As is largely known DNA is composed of four different nucleotides: A, C, T and G and proteins can contain up to 20 amino acids. Each one of these elements have one letter of the alphabet assigned to them. In the DNA case some letters represent one or more nucleotides that can be present a…</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part10&amp;rev=1242317692&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-14T12:14:52-04:00</dc:date>
        <title>part10</title>
        <link>http://wiki.genedrift.org/doku.php?id=part10&amp;rev=1242317692&amp;do=diff</link>
        <description>&quot;Cutting&quot; chromosomes with Python: a fast approach

Last couple of posts we started with some functional programming aspects of Python. I was away last week and couldn't create anything related to FP in the meantime, so I decided to post about a quick way to “cut”, or extract segments, from chromosomes stored as FASTA files.</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part11&amp;rev=1259955530&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-12-04T14:38:50-04:00</dc:date>
        <title>part11</title>
        <link>http://wiki.genedrift.org/doku.php?id=part11&amp;rev=1259955530&amp;do=diff</link>
        <description>Python sets

This time we will check how set works in Python. set is another object type available in Python (version 2.3 and up) that brings a lot of features to the language.

From the Python Library Reference: “A set object is an unordered collection of immutable values. Common uses include membership testing, removing duplicates from a sequence, and computing mathematical operations such as intersection, union, difference, and symmetric difference.” Yep, all these are possible with set, and …</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part12&amp;rev=1242834101&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-20T11:41:41-04:00</dc:date>
        <title>part12</title>
        <link>http://wiki.genedrift.org/doku.php?id=part12&amp;rev=1242834101&amp;do=diff</link>
        <description>Generating reverse complement of DNA sequences


As shown in the GenBank DNA parser script, it is really useful to have the ability to get the reverse complement of some DNA sequences. The reverse complement of a 5'-3' DNA sequence is on its complementary strand. Using our fasta module it is easy to implement a function to generate the antiparallel sequence</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part13&amp;rev=1242853116&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-20T16:58:36-04:00</dc:date>
        <title>part13</title>
        <link>http://wiki.genedrift.org/doku.php?id=part13&amp;rev=1242853116&amp;do=diff</link>
        <description>Python sets and intersections

This time we will see a different example of set usage that includes some methods available for this type of objects. The initial problem was how to determine which genomes are represented in protein/DNA clusters obtained with  CD-HIT. Basically CD-HIT uses a multiple fasta file to generate clusters of proteins/DNA using their sequence identity. It clusters the sequences, keeping their fasta title and assigning an ID for each cluster obtained. We won't see how is C…</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part14&amp;rev=1242919422&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-21T11:23:42-04:00</dc:date>
        <title>part14</title>
        <link>http://wiki.genedrift.org/doku.php?id=part14&amp;rev=1242919422&amp;do=diff</link>
        <description>1

This part is based on the conversion of some small C++ programs I developed in the past. These small programs (I call them modules because they were part of a larger application) were used to count motifs, short nucleotide &lt;i&gt;words&lt;/i&gt; up to 10-12 base pairs, and then calculate statistical overrepresentation of these words by comparing a foreground set of DNA sequences against a background set.</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part15&amp;rev=1243006104&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-22T11:28:24-04:00</dc:date>
        <title>part15</title>
        <link>http://wiki.genedrift.org/doku.php?id=part15&amp;rev=1243006104&amp;do=diff</link>
        <description>1


We created a very elegant script in Python that efficiently counts words in FASTA sequences and then using a basic statistical method, calculates the significance of each word and output the overrepresented ones.

Our script used a little bit less than 50 lines, and if you include the imported fasta module, it won't top 100. But the number of lines is not important. The efficiency, clarity and speed are key here. At the same time, running a script from the command line is not something every…</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part16&amp;rev=1243006578&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-22T11:36:18-04:00</dc:date>
        <title>part16</title>
        <link>http://wiki.genedrift.org/doku.php?id=part16&amp;rev=1243006578&amp;do=diff</link>
        <description>1


Let's check some database topics in Python. I was asked to create a simple database to organize wet-lab stuff. No relationships needs, no relational tables required. Just a simple table with determined columns, and a nice GUI to go with it so people can edit, search and use.</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part2&amp;rev=1242056287&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-11T11:38:07-04:00</dc:date>
        <title>part2</title>
        <link>http://wiki.genedrift.org/doku.php?id=part2&amp;rev=1242056287&amp;do=diff</link>
        <description>Flow control in Python

We start following the fifth chapter of BPB. The first item is about flow control and code layout, which are very relevant for our tutorial. We already introduced briefly both aspects in past entries on the site, but it is always good to check. As the book, I will start with flow control.</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part3&amp;rev=1242057020&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-11T11:50:20-04:00</dc:date>
        <title>part3</title>
        <link>http://wiki.genedrift.org/doku.php?id=part3&amp;rev=1242057020&amp;do=diff</link>
        <description>Debugging in Python

Beginning the third section in our tutorial/guide, we are going to see the chapter six of BPB. This chapter discusses the topics of creating subroutines (in Python's case functions) and debugging the code.

We are going to start by the end. In Python, code debugging can be done as in any other programming language: Perl has pdb, C/C++ has gdb, etc. Python also has a pdb module that can be imported and run to check for errors in your code. Using this command line:</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part4&amp;rev=1242315514&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-14T11:38:34-04:00</dc:date>
        <title>part4</title>
        <link>http://wiki.genedrift.org/doku.php?id=part4&amp;rev=1242315514&amp;do=diff</link>
        <description>Randomization

We start section 4 with a very short introduction to randomization. Chapter 7 of BPB discuss the use of randomization to obtain mutations in DNA and protein sequences. The example given in the book is at the same time simple and interesting, as it creates a paragraph from random selections of nouns, adjectives, verbs and other grammar elements. Here we are going to to create a very (stress on very) simple dice game, where each time you run the script it will throw two dices for yo…</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part5&amp;rev=1242138582&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-12T10:29:42-04:00</dc:date>
        <title>part5</title>
        <link>http://wiki.genedrift.org/doku.php?id=part5&amp;rev=1242138582&amp;do=diff</link>
        <description>Genetic code

The Python dictionary data-type is like hash in Perl. It is a very similar structure, where each element in the variable is composed of two values, more specifically a key-value pair. This is the ideal data type to store the genetic code. As you might know the genetic code governs the translation of DNA into proteins, where each codon (3 bases or nucleotides in the DNA sequence) correspond to an amino acid in the protein sequence. So for every sequence of 3 nucleotides (key) will r…</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part6&amp;rev=1265883680&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2010-02-11T05:21:20-04:00</dc:date>
        <title>part6</title>
        <link>http://wiki.genedrift.org/doku.php?id=part6&amp;rev=1265883680&amp;do=diff</link>
        <description>Finding motifs: IUPAC and RegEx, an overview

For anyone also following the book there will be a jump at the end of chapter 8, so we are not touching the final script that deals with different reading frames here. We are going straight, or almost, to another take of Regular Expressions. We are going to check some aspects of restriction enzymes, but first we are going to touch base with motif finding in DNA sequences.</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part7&amp;rev=1242833675&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-20T11:34:35-04:00</dc:date>
        <title>part7</title>
        <link>http://wiki.genedrift.org/doku.php?id=part7&amp;rev=1242833675&amp;do=diff</link>
        <description>GenBank files: take one

This part deals with the manipulation of  GenBank files. These files are used by NCBI to store information about RNA, DNA and protein sequences. It is usually composed of an annotation section, that gives information about the sequence present in the particular file. I won't spend much time explaining the GenBank format, because it is not the goal of the site. The perl book has some good explanation about it and you can also find more information  here. Also, we are goin…</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part8&amp;rev=1242225368&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-13T10:36:08-04:00</dc:date>
        <title>part8</title>
        <link>http://wiki.genedrift.org/doku.php?id=part8&amp;rev=1242225368&amp;do=diff</link>
        <description>Splitting a multiple sequence FASTA file

We will see the easiest way to split a FASTA file. There are many ways to name each file after the split, and we will use a simple &lt;i&gt;original filename&lt;/i&gt; + number. We also going to use the FASTA class and functions defined previously in  fasta.py. To refresh our memories here is the code</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=part9&amp;rev=1242316151&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-05-14T11:49:11-04:00</dc:date>
        <title>part9</title>
        <link>http://wiki.genedrift.org/doku.php?id=part9&amp;rev=1242316151&amp;do=diff</link>
        <description>Functional programming in Python: using map


First we need to define what is functional programming. Quoting  Wikipedia: “is a programming paradigm that treats computation as the evaluation of mathematical functions and avoids state and mutable data. It emphasizes the application of functions, in contrast with the imperative programming style that emphasizes changes in state.”</description>
    </item>
    <item rdf:about="http://wiki.genedrift.org/doku.php?id=start&amp;rev=1240941445&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2009-04-28T13:57:25-04:00</dc:date>
        <title>start</title>
        <link>http://wiki.genedrift.org/doku.php?id=start&amp;rev=1240941445&amp;do=diff</link>
        <description>This pages will contain information regarding the Beginning Python for Bioinformatics blog and bioinformatics software and projects. To see more click the links below:

Beginning Python for Bioinformatics

Software

Projects</description>
    </item>
</rdf:RDF>

