<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Dammit Jim! &#187; Biologist</title>
	<atom:link href="http://scott.sherrillmix.com/blog/category/biologist/feed/" rel="self" type="application/rss+xml" />
	<link>http://scott.sherrillmix.com/blog</link>
	<description>I'm a biologist not a...</description>
	<lastBuildDate>Mon, 06 Feb 2012 05:19:08 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
		<item>
		<title>The Birds and the Bees of Leatherbacks</title>
		<link>http://scott.sherrillmix.com/blog/biologist/the-birds-and-the-bees-of-leatherbacks/</link>
		<comments>http://scott.sherrillmix.com/blog/biologist/the-birds-and-the-bees-of-leatherbacks/#comments</comments>
		<pubDate>Sat, 11 Jul 2009 03:23:36 +0000</pubDate>
		<dc:creator>ScottS-M</dc:creator>
				<category><![CDATA[Biologist]]></category>
		<category><![CDATA[Leatherback]]></category>
		<category><![CDATA[female]]></category>
		<category><![CDATA[male]]></category>
		<category><![CDATA[penis]]></category>
		<category><![CDATA[sea turtle]]></category>
		<category><![CDATA[tail]]></category>
		<category><![CDATA[turtle]]></category>

		<guid isPermaLink="false">http://scott.sherrillmix.com/blog/?p=488</guid>
		<description><![CDATA[When I was writing up the last post on the Great Turtle Race, I came across this Wikipedia page that has details on every Colbert report ever (what doesn&#8217;t Wikipedia have?). It included this quote on Stephen Colbert&#8217;s leatherback: Stephen is unhappy at the fact that Stephanie Colburtle The Turtle did not win The Great [...]]]></description>
			<content:encoded><![CDATA[<img src="/res/images/colburtle.jpg" alt="Colburtle Leatherback Turtle" class='right'/>
<p>When I was writing up the last post on the <a href="http://scott.sherrillmix.com/blog/biologist/great-turtle-race/">Great Turtle Race</a>, I came across this Wikipedia page that has <a href="http://en.wikipedia.org/wiki/List_of_The_Colbert_Report_episodes_(2007)">details on every Colbert report</a> ever (what doesn&#8217;t Wikipedia have?). It included this quote on Stephen Colbert&#8217;s leatherback:</p>

<blockquote><p>Stephen is unhappy at the fact that Stephanie Colburtle The Turtle did not win The Great Turtle Race, after being bested by another turtle named Billy. He claims Billy is a male, and demands a re-race. (After explaining that one can tell the sex of a turtle by the concavity of its plastron, Stephen says that he checks the plastron on &#8220;all [his] dates, and if it&#8217;s not concave, [he is] outta there.&#8221; However, a concave plastron denotes a male turtle.)</p></blockquote>

<p>Now I&#8217;m not totally sure the concave plastron bit works with leatherbacks since they&#8217;re more barrel-shaped than turtle-shaped but I guess it&#8217;s possible. But on the topic, I just thought I&#8217;d share a couple tips for determining leatherback sex.</p>
<a href="http://www.seaturtle.org/cgi-bin/imagelib/index.pl?photo=2469"><img src="/res/images/green_turtle_tail.jpg" alt="Male green turtle tail by Daha DIEW et Alain GIBUDI" class="right"/></a>
<p>First, is it on a beach? If so, it&#8217;s female. Healthy male leatherbacks never return to land after their initial crawl from the nest to the ocean. That makes research programs that catch turtles at sea the only way to look at male leatherbacks.</p>
<p>Second, does it have a long tail that trails well behind the shell? Then it&#8217;s a male. Leatherbacks (and other sea turtles) store their penis in their tail. The tails of female turtles barely extend past their shell. The tails of male turtles, shall we say, hang low and wobble to and fro. I couldn&#8217;t dig up a picture of a male leatherback but here&#8217;s a picture of a male green sea turtle tail (from <a href="http://www.seaturtle.org/cgi-bin/imagelib/index.pl?photo=2469">seaturtle.org</a> courtesy of Daha Diew and Alain Gibudi) that should give an idea (that&#8217;s its rear flippers in the left edge of the picture).</p>
<p>And now you know.</p>]]></content:encoded>
			<wfw:commentRss>http://scott.sherrillmix.com/blog/biologist/the-birds-and-the-bees-of-leatherbacks/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Great Turtle Race</title>
		<link>http://scott.sherrillmix.com/blog/biologist/great-turtle-race/</link>
		<comments>http://scott.sherrillmix.com/blog/biologist/great-turtle-race/#comments</comments>
		<pubDate>Sat, 30 May 2009 06:27:03 +0000</pubDate>
		<dc:creator>ScottS-M</dc:creator>
				<category><![CDATA[Biologist]]></category>
		<category><![CDATA[Leatherback]]></category>
		<category><![CDATA[great turtle race]]></category>
		<category><![CDATA[national geographic]]></category>
		<category><![CDATA[satellite telemetry]]></category>
		<category><![CDATA[tracking]]></category>

		<guid isPermaLink="false">http://scott.sherrillmix.com/blog/?p=458</guid>
		<description><![CDATA[I&#8217;m a bit late on this one (I don&#8217;t know how I managed to miss it since I had to put the data together) but National Geographic and Conservation International had a Great Turtle Race with a bunch of leatherback turtles tagged by my old adviser. They took data from turtle tracked from Nova Scotia [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m a bit late on this one (I don&#8217;t know how I managed to miss it since I had to put the data together) but National Geographic and Conservation International had a <a href="http://www.conservation.org/great_turtle_race/pages/main.aspx">Great Turtle Race</a> with a bunch of leatherback turtles tagged by my old adviser. They took data from turtle tracked from Nova Scotia to South America and had a big two-week event watching which turtle reached the Caribbean first. They have a pretty <a href="http://animals.nationalgeographic.com/greatturtlerace-map/">cool animation</a> of the satellite tracking (although of course not quite as good as <a href="http://scott.sherrillmix.com/blog/biologist/when-do-leatherback-turtles-migrate-south/">mine</a>) and some cute <a href="http://kidsblogs.nationalgeographic.com/greenscene/turtles.html">leatherback artwork</a> (complete with leathery back instead of shell, although why are they green?).</p>
<a href="http://animals.nationalgeographic.com/greatturtlerace-map/"><img src="/res/images/greatTurtleRace.png" alt="Flash animation of Great Turtle Race" class="center"/></a>
<a href="http://kids.nationalgeographic.com/Games/ActionGames/Great-turtle-race"><img src="/res/images/greatTurtleRace_game.png" alt="Leatherback turtle game" class="right"/></a>
<p>That site also has the first <a href="http://kids.nationalgeographic.com/Games/ActionGames/Great-turtle-race">leatherback game</a> I&#8217;ve ever seen. The artist did a really good job since the view is pretty much identical to the view we get from a shoulder mounted turtleCam. Unfortunately, the game turtle handles like a tank which really doesn&#8217;t do justice to the maneuvering ability of leatherbacks. They&#8217;re huge animals but in the water they&#8217;re really quite graceful and they can turn on a dime (as I quickly found out when we were trying to catch them).</p>
<a href="http://kidsblogs.nationalgeographic.com/greenscene/turtles.html"><img src="/res/images/greatTurtleRace_backspacer.png" alt="Backspacer leatherback by Chris Rooney" class="left"/></a>
<p>Anyway, it looks like the turtle named Backspacer (it&#8217;s weird to see all the interesting names since we always call the turtles by their tag ID number), sponsored by Pearl Jam, yes that <a href="http://www.pearljam.com/news/great-turtle-race-update-backspacer-wins-loomstate-responds-pearl-jam-audience-favorite-challen">Pearl Jam</a>, won the race.  Turtle Cali won the diving portion of the race and received an <a href="http://deepseanews.com/2009/05/world-champion-leatherback-wins-iron-turtle-award/">Iron Turtle Award</a>.  Here&#8217;s a nice <a href="http://blog.conservation.org/2009/04/where-are-they-now-the-true-stories-of-each-racing-turtle/">post-race summary</a> and also Olympic swimmer (and turtle coach) <a href="http://blog.conservation.org/2009/04/coach-jason-lezak-wraps-up-the-race/">Jason Lezak&#8217;s take on it</a>.  It&#8217;s great to see so much public interest in leatherback turtle tracking and National Geographic and Conservation International did a great job promoting and running the event.</p>
]]></content:encoded>
			<wfw:commentRss>http://scott.sherrillmix.com/blog/biologist/great-turtle-race/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Counting Q20 Bases in a .qual File</title>
		<link>http://scott.sherrillmix.com/blog/biologist/counting-q20-bases-in-a-qual-file/</link>
		<comments>http://scott.sherrillmix.com/blog/biologist/counting-q20-bases-in-a-qual-file/#comments</comments>
		<pubDate>Wed, 18 Mar 2009 16:43:17 +0000</pubDate>
		<dc:creator>ScottS-M</dc:creator>
				<category><![CDATA[Bash/UNIX]]></category>
		<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[Biologist]]></category>
		<category><![CDATA[Programmer]]></category>
		<category><![CDATA[count]]></category>
		<category><![CDATA[grep]]></category>
		<category><![CDATA[Q20]]></category>
		<category><![CDATA[qual]]></category>
		<category><![CDATA[qualities]]></category>
		<category><![CDATA[sed]]></category>
		<category><![CDATA[sequencing]]></category>

		<guid isPermaLink="false">http://scott.sherrillmix.com/blog/?p=422</guid>
		<description><![CDATA[I sometimes get asked to count the number of bases with qualities greater than or equal to 20 in a quality file. I&#8217;m not entirely sure this is all that good a metric with 454 sequencing but that&#8217;s another story. It always takes me a minute or two to come up with the right Unix [...]]]></description>
			<content:encoded><![CDATA[<p>I sometimes get asked to count the number of bases with qualities greater than or equal to 20 in a quality file. I&#8217;m not entirely sure this is all that good a metric with 454 sequencing but that&#8217;s another story. It always takes me a minute or two to come up with the right Unix commands to do it so I&#8217;m going to post it here so I remember (and maybe save someone else a couple minutes).</p>
<p><code>
cat *qual|grep &#039;^[^&gt;]&#039;|sed &#039;s/ /\n/g&#039;|grep -c [234][0-9]
</code></p>
<p>This is very quick and dirty (just removing lines starting with &#8220;&gt;&#8221;, replacing spaces with newlines and counting the resulting lines with quals 20-40) but it seems to work ok for me. Also yes I know it&#8217;s stupid to cat to a grep but I often replace the cat with head for testing. And I&#8217;m sure you could do it in a single awk or sed step but it gets done in a minute or two for several hundred million bases so I haven&#8217;t really been motivated to change it.</p>
]]></content:encoded>
			<wfw:commentRss>http://scott.sherrillmix.com/blog/biologist/counting-q20-bases-in-a-qual-file/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Converting Between Gapped and NonGapped DNA Coordinates in R</title>
		<link>http://scott.sherrillmix.com/blog/biologist/converting-between-gapped-and-nongapped-dna-coordinates-in-r/</link>
		<comments>http://scott.sherrillmix.com/blog/biologist/converting-between-gapped-and-nongapped-dna-coordinates-in-r/#comments</comments>
		<pubDate>Tue, 17 Feb 2009 09:27:17 +0000</pubDate>
		<dc:creator>ScottS-M</dc:creator>
				<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[Biologist]]></category>
		<category><![CDATA[Programmer]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[alignment]]></category>
		<category><![CDATA[convert]]></category>
		<category><![CDATA[DNA]]></category>
		<category><![CDATA[gapped]]></category>
		<category><![CDATA[gaps]]></category>
		<category><![CDATA[sequence]]></category>

		<guid isPermaLink="false">http://scott.sherrillmix.com/blog/?p=393</guid>
		<description><![CDATA[I've been needing to convert coordinates in gapped DNA alignments back and forth to coordinates in the nongapped sequence a lot recently. For example, the T in AC--TGA--A is in the 5th position but in the gapless sequence ACTGAA it is 3rd. My first few tries (counting chars in regex'd substrings) at programming up a [...]]]></description>
			<content:encoded><![CDATA[<p>I've been needing to convert coordinates in gapped DNA alignments back and forth to coordinates in the nongapped sequence a lot recently. For example, the T in AC--TGA--A is in the 5th position but in the gapless sequence ACTGAA it is 3rd. My first few tries (counting chars in regex'd substrings) at programming up a decent converters ended up not working very well with large datasets. Since I had to do it with 50,000 positions in a sequence with 2 million letters, I was running into a little trouble. But it ended up actually being a pretty easy problem in R so I thought I'd post it up in case anybody else is running into something similar. Also it's a pretty good example why programming in R can be fun.</p>

<div class="syntax_hilite"><span class="langName">R:</span><br /><div id="r-3">
<div class="r"><ol><li style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">gap2NoGap&lt;-<span style="color: #a020f0;">function</span><span style="color: #66cc66;">&#40;</span>gapSeq,coords<span style="color: #66cc66;">&#41;</span><span style="color: #66cc66;">&#123;</span></div></li>
<li style="font-weight: bold;color:#26536A;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp;gapSeqSplit&lt;-strsplit<span style="color: #66cc66;">&#40;</span>gapSeq,''<span style="color: #66cc66;">&#41;</span><span style="color: #66cc66;">&#91;</span><span style="color: #66cc66;">&#91;</span><span style="color: #cc66cc;color:#800000;">1</span><span style="color: #66cc66;">&#93;</span><span style="color: #66cc66;">&#93;</span></div></li>
<li style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp;nonDash&lt;-gapSeqSplit!='-'</div></li>
<li style="font-weight: bold;color:#26536A;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp;newCoords&lt;-cumsum<span style="color: #66cc66;">&#40;</span>nonDash<span style="color: #66cc66;">&#41;</span></div></li>
<li style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp;<span style="color: #a020f0;">return</span><span style="color: #66cc66;">&#40;</span>newCoords<span style="color: #66cc66;">&#91;</span>coords<span style="color: #66cc66;">&#93;</span><span style="color: #66cc66;">&#41;</span></div></li>
<li style="font-weight: bold;color:#26536A;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;"><span style="color: #66cc66;">&#125;</span> </div></li></ol></div>
</div></div><br />
<p>So the function takes a gapped sequence and the gapped coordinates. It splits the gapped sequence into an array of single characters and finds which ones are not -'s. It then takes the cumulative sum at each element in the array (TRUE evaluates as 1 and FALSE as 0). This gives an array where each element gives the new gap-free coordinates. So the function just returns the appropriate new coordinate for each old coordinate.</p>
<div class="syntax_hilite"><span class="langName">R:</span><br /><div id="r-4">
<div class="r"><ol><li style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">noGap2Gap&lt;-<span style="color: #a020f0;">function</span><span style="color: #66cc66;">&#40;</span>gapSeq,coords<span style="color: #66cc66;">&#41;</span><span style="color: #66cc66;">&#123;</span></div></li>
<li style="font-weight: bold;color:#26536A;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp;gapSeqSplit&lt;-strsplit<span style="color: #66cc66;">&#40;</span>gapSeq,''<span style="color: #66cc66;">&#41;</span><span style="color: #66cc66;">&#91;</span><span style="color: #66cc66;">&#91;</span><span style="color: #cc66cc;color:#800000;">1</span><span style="color: #66cc66;">&#93;</span><span style="color: #66cc66;">&#93;</span></div></li>
<li style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp;newCoords&lt;-which<span style="color: #66cc66;">&#40;</span>gapSeqSplit!='-'<span style="color: #66cc66;">&#41;</span></div></li>
<li style="font-weight: bold;color:#26536A;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;">&nbsp; &nbsp;<span style="color: #a020f0;">return</span><span style="color: #66cc66;">&#40;</span>newCoords<span style="color: #66cc66;">&#91;</span>coords<span style="color: #66cc66;">&#93;</span><span style="color: #66cc66;">&#41;</span></div></li>
<li style="font-family: 'Courier New', Courier, monospace; color: black; font-weight: normal; font-style: normal;color:#3A6A8B;"><div style="font-family: 'Courier New', Courier, monospace; font-weight: normal;"><span style="color: #66cc66;">&#125;</span> </div></li></ol></div>
</div></div><br />
<p>This opposite conversion function again takes the gapped sequence and appropriately the nongapped coordinates. It splits the gapped sequence into an array of single characters and finds the indices of the characters which are not -'s. Since we only stored the indices of nongap characters, this array gives the gapped coordinates for each nongap letter.  So again the function can just returns the appropriate new coordinate for each old coordinate.</p>
<p>These ended up being pretty elegant in R (once I finally figured them out).</p>
]]></content:encoded>
			<wfw:commentRss>http://scott.sherrillmix.com/blog/biologist/converting-between-gapped-and-nongapped-dna-coordinates-in-r/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Functional Metagenomics: Sequence Everything and Let DNA Sort The Functions Out</title>
		<link>http://scott.sherrillmix.com/blog/biologist/functional-metagenomics-sequence-everything-and-let-dna-sort-the-functions-out/</link>
		<comments>http://scott.sherrillmix.com/blog/biologist/functional-metagenomics-sequence-everything-and-let-dna-sort-the-functions-out/#comments</comments>
		<pubDate>Mon, 27 Oct 2008 03:57:55 +0000</pubDate>
		<dc:creator>ScottS-M</dc:creator>
				<category><![CDATA[Biologist]]></category>
		<category><![CDATA[Statistician]]></category>
		<category><![CDATA[bacteria]]></category>
		<category><![CDATA[DNA]]></category>
		<category><![CDATA[DNA sequencing]]></category>
		<category><![CDATA[metagenomics]]></category>
		<category><![CDATA[pyrosequencing]]></category>
		<category><![CDATA[virus]]></category>

		<guid isPermaLink="false">http://scott.sherrillmix.com/blog/?p=163</guid>
		<description><![CDATA[One of the cool things you can do with the high throughput DNA analysis of pyrosequencing, is to collect a sample from the environment, isolate the DNA from everything in it and sequence it. Then you can match the DNA up with known sequences and see what sort of microbes you had. Dinsdale and a [...]]]></description>
			<content:encoded><![CDATA[<a href="http://www.researchblogging.org"><img alt="ResearchBlogging.org" src="http://www.researchblogging.org/images/rbicons/ResearchBlogging-Medium-Trans.png" width="80" height="50" class="left"/></a>
<p>One of the cool things you can do with the high throughput DNA analysis of <a href="http://scott.sherrillmix.com/blog/biologist/can-you-sequence-a-bacterias-entire-genome-overnight/">pyrosequencing</a>, is to collect a sample from the environment, isolate the DNA from everything in it and sequence it. Then you can match the DNA up with known sequences and see what sort of microbes you had. Dinsdale and a bunch of coauthors collected the data from a bunch of such studies. They managed to find 45 bacterial samples and 42 viral samples from 9 broad environmental classifications. You can see all the different samples the authors pooled together (circles microbial and squares viral).</p>
<img src="/res/images/metagenomics_locations.png" alt="Locations of metagenomic samples from Dinsdale et al." class="center"/>
<p>The interesting thing about this study was that instead of looking at the taxonomy of the critters as usual, they looked at the <em>function</em> of the genes. By simply looking at what the genes do, the researchers hoped to get a feel for what activities were going on in that environment without necessarily having to identify the species of the bacteria and viruses. To do this, they fed their 14.5 million sequences (pyrosequencing sure can generate data) into the <a href="http://www.theseed.org/wiki/Main_Page">SEED database</a>, a big collection of genes which have been assigned to functions (for example membrane transport or sulphur metabolism) by experts. They were able to match 1 million of the bacterial and 500,000 of the viral sequences to previously identified gene functions.</p>
<p>It might seem odd that they would look at viral DNA since viruses are rather simple and have only a few basic genes. But the researchers were actually looking at bacterial genetic sequences being carried inside viruses. This of course brings up the question of what bacterial DNA is doing inside viruses. It turns out there are a lot of bacteriophage viruses that like to infect bacteria and sometimes these viruses capture some of the DNA of their bacterial hosts and carry it to their next host. Looking at the bacterial DNA present in a viral population gives an interesting look at what types of genes are being passed around between individual bacteria (and even between bacterial species).</p>
<p>So here are the high level classifications of the function of the genes they found for each environment.</p>
<img src="/res/images/metagenomics_percent_functions.png" alt="Percentages of gene function of bacterial and viral gene function from Dinsdale et al." title="Percentages of gene function of bacterial and viral gene function from Dinsdale et al." class="center"/>
<p>It's pretty cool that the viruses were carrying around so much of a variety of bacterial DNA. The authors suggest that motility genes coding for things like flagella and cilia (which could help the bacterial host spread the virus further) were enriched in the viral samples but it seems a bit hard to say that for certain without a bit more analysis.</p>
<p>A useful way to look at huge masses of data, like their 1.5 million matches, is to try and reduce all the different counts in the functional categories into a couple of condensed variables. This can be seen in the next couple plots. They could use a little explaining. Bacterial sequences are on top and viral sequences on the bottom. Lines show how the various functional categories have been condensed into the x and y variables. For example, samples that contained lots of genes for making cell walls will tend to be at the top of the plot in the bacterial samples and tend not to have many genes for respiration.</p>
<img src="/res/images/metagenomics_cdf.png" alt="Canonical discriminant function analysis of bacterial and viral gene function from Dinsdale et al." title="Canonical discriminant function analysis of bacterial and viral gene function from Dinsdale et al." class="center"/>
<p>It's pretty cool to see how the various environments clustered with other samples from the same environment. For example, all the yellow diamond fish farm samples ended up on the right side of the bacteria graphs even though they were sampled independently. It appears that functions seem to correlate with environmental conditions. For example, the fish food at the fish farms contained a lot of sulfur supplements and the bacteria from those samples were rich in sulfur metabolism genes and the bacteria from corals contained many different respiration genes to deal with the highly variable oxygen concentrations found there. Dinsdale and her coauthors go so far as to suggest that gene function may provide a better indicator of environment than the taxonomy of the bacteria present.</p>
<p>The paper did have a little trouble in the math in one part but the authors already have a correction in for it so it's really not worth worrying about. Overall, it was a pretty interesting story and a good example of stuff to do with a sequencing machine (also it must have taken a good bit of work to collect all that data together from all those authors).</p>

<h3>References</h3>
<p><span class="Z3988" title="ctx_ver=Z39.88-2004&#038;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&#038;rft.jtitle=Nature&#038;rft.id=info:DOI/10.1038%2Fnature06810&#038;rft.atitle=Functional+metagenomic+profiling+of+nine+biomes&#038;rft.date=2008&#038;rft.volume=452&#038;rft.issue=7187&#038;rft.spage=629&#038;rft.epage=632&#038;rft.artnum=http%3A%2F%2Fwww.nature.com%2Fdoifinder%2F10.1038%2Fnature06810&#038;rft.au=Elizabeth+A.+Dinsdale&#038;rft.au=Robert+A.+Edwards&#038;rft.au=Dana+Hall&#038;rft.au=Florent+Angly&#038;rft.au=Mya+Breitbart&#038;rft.au=Jennifer+M.+Brulc&#038;rft.au=Mike+Furlan&#038;rft.au=Christelle+Desnues&#038;rft.au=Matthew+Haynes&#038;rft.au=Linlin+Li&#038;rft.au=Lauren+McDaniel&#038;rft.au=Mary+Ann+Moran&#038;rft.au=Karen+E.+Nelson&#038;rft.au=Christina+Nilsson&#038;rft.au=Robert+Olson&#038;rft.au=John+Paul&#038;rft.au=Beltran+Rodriguez+Brito&#038;rft.au=Yijun+Ruan&#038;rft.au=Brandon+K.+Swan&#038;rft.au=Rick+Stevens&#038;rft.au=David+L.+Valentine&#038;rft.au=Rebecca+Vega+Thurber&#038;rft.au=Linda+Wegley&#038;rft.au=Bryan+A.+White&#038;rft.au=Forest+Rohwer&#038;bpr3.included=1&#038;bpr3.tags=Biology">Elizabeth A. Dinsdale, Robert A. Edwards, Dana Hall, Florent Angly, Mya Breitbart, Jennifer M. Brulc, Mike Furlan, Christelle Desnues, Matthew Haynes, Linlin Li, Lauren McDaniel, Mary Ann Moran, Karen E. Nelson, Christina Nilsson, Robert Olson, John Paul, Beltran Rodriguez Brito, Yijun Ruan, Brandon K. Swan, Rick Stevens, David L. Valentine, Rebecca Vega Thurber, Linda Wegley, Bryan A. White, Forest Rohwer (2008). <cite>Functional metagenomic profiling of nine biomes</cite> Nature, 452 (7187), 629-632 DOI: <a rev="review" href="http://dx.doi.org/10.1038/nature06810">10.1038/nature06810</a></span></p>
]]></content:encoded>
			<wfw:commentRss>http://scott.sherrillmix.com/blog/biologist/functional-metagenomics-sequence-everything-and-let-dna-sort-the-functions-out/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

