<?xml version="1.0" encoding="ISO-8859-1" ?>
<rss version="2.0">
<channel>
	<title>Bioinformatics and BioStatistics Method Forum</title>
	<description></description>
	<link>http://www.protocol-online.org/forums</link>
	<pubDate>Fri, 10 Feb 2012 09:48:01 +0000</pubDate>
	<ttl>360</ttl>
	<item>
		<title>Determining the N- and C-terminal regions in a protein</title>
		<link>http://www.protocol-online.org/forums/topic/24370-determining-the-n-and-c-terminal-regions-in-a-protein/</link>
		<description><![CDATA[Dear all,<br />
 <br />
So, something's been bugging me all night. I'm doing a literature review on the polyester synthase and a group of related proteins. Coming to the molecular aspect, I encounter the terms "N-terminal / C-terminal regions" often.<br />
 <br />
How do we (or in the near future, I) determine / decide where the N-terminal or C-terminal spans?<br />
 <br />
The first thought that came into my mind was a multiple alignment with the same protein in other organisms. But that&#8217;s only useful for those having a conserved N or C  regions. What more, this particular group of protein has a &#8220;non-conserved N-terminal region&#8221; while the alpha/beta hydrolase fold domain is conserved in part of the C-terminal (see below).<br />
 <br />
1......98.............................................589<br />
NNNNNC<span style='color: #0000FF'><strong class='bbc'>C</strong></span>CCC<span style='color: #0000ff'><strong class='bbc'>CC</strong></span>C<span style='color: #0000FF'><strong class='bbc'>CCa/bCC</strong></span>C<span style='color: #0000FF'><strong class='bbc'>C</strong></span>CCC<br />
 <br />
<em class='bbc'>The regions based on the protein of the model organism. (conserved region in blue)</em><br />
 <br />
Until now, only the protein in the model organism is well-studied and annotated. So, another way is to align the amino acid sequence from the model together with those from other organisms. Where the C-terminal begins (aspartate 99), that position will be a benchmark of sort to the other query sequence. Still, I felt this isn&#8217;t very convincing.<br />
 <br />
Waitiing for some feedbacks / suggestions / pointers regarding this.<br />
 <br />
Thank you]]></description>
		<pubDate>Fri, 10 Feb 2012 09:48:01 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/24370-determining-the-n-and-c-terminal-regions-in-a-protein/</guid>
	</item>
	<item>
		<title>BLASTing protein block</title>
		<link>http://www.protocol-online.org/forums/topic/24196-blasting-protein-block/</link>
		<description><![CDATA[Hi<br />
I have a question, I generated protein block, which represent a conserved domain (moti) in a family of proteins I'm working on, now I would like to BLAST this motif, How can it be done ?<br />
 <br />
thanks<br />
Ofir]]></description>
		<pubDate>Sun, 29 Jan 2012 13:32:52 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/24196-blasting-protein-block/</guid>
	</item>
	<item>
		<title>how to find copy number for different alu subfamilies</title>
		<link>http://www.protocol-online.org/forums/topic/24169-how-to-find-copy-number-for-different-alu-subfamilies/</link>
		<description><![CDATA[Is there a easy way to find out the copy number of invidual alu subfamilies in some database ?? Its possible to identify the subfamilies in repbase but i find it really tricky to get a full report over the copy numbers =/<br />
 <br />
 <br />
Please help me=)]]></description>
		<pubDate>Thu, 26 Jan 2012 23:05:35 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/24169-how-to-find-copy-number-for-different-alu-subfamilies/</guid>
	</item>
	<item>
		<title>Determining gene of interest from Cell culture lines</title>
		<link>http://www.protocol-online.org/forums/topic/24142-determining-gene-of-interest-from-cell-culture-lines/</link>
		<description><![CDATA[Hi there, I am a summer research student currently performing PCRs to check for the presence of opioid receptors in various cell lines.<br />
 <br />
What I've done so far is to extract mRNA from cell cultures (e.g. 4T1, Raw 264.7 and H5V), doing a reverse transcription and performing PCRs on the cDNA to check if there are any opioid receptors present.<br />
 <br />
My PCRs have been very inconclusive ( I usually  get primer dimers) so I'm trying to check and see if my primers will actually work on the cell lines.<br />
 <br />
However! BLAST is extremely confusing and I have no idea if it's possible to check my primer design against cell lines. Where can I get the genomic sequence for 4T1, H5v etc, etc.<br />
 <br />
Please, any enlightenment on the topic will be most appreciated!<br />
 <br />
~Byong]]></description>
		<pubDate>Wed, 25 Jan 2012 02:29:34 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/24142-determining-gene-of-interest-from-cell-culture-lines/</guid>
	</item>
	<item>
		<title>Timeline statistics</title>
		<link>http://www.protocol-online.org/forums/topic/24033-timeline-statistics/</link>
		<description><![CDATA[Hi Everyone, I am a PhD student with some difficulties about what test to choose to verify an experiment. Hope you can give me some help!<br />
 <br />
My experiment:<br />
I am counting the number of a subset of cells in Drosophila overtime.<br />
I have a mutant and a wild type control. I dissect the flies at different time points: just eclosed, 1 day after eclosion, 3 dae, 6 dae, 9 dae, 12 dae and 15 dae. For each time-point I have a n number of around 50.<br />
 <br />
The problem:<br />
What happen with these cells is that in the wild type population the number of cells does not decrease overtime, while in the mutants it does. Moreover the wild type population has an average of 6 (in all the dissection days), while the mutant starts with 4 and goes down to 2. Maybe if they were the same amount initially I could have used an anova test with a posthoc test (like tukey) to show the difference between wild type and control for each day, but in this case an anova would tell me only that all the wild type are different from the mutants (even at eclosion!) because of the basic difference they have, or that for example mutant dae15 is different from mutant dae0,1,3. Moreover, what I would like to test against is not that the wild type is different from the mutant at a particular timepoint, but that the mutant numbers decrease faster than the control numbers.<br />
 <br />
Possible solutions:<br />
I was thinking that one possibility would be to do an anova on a normalized sample: I could divide every number (and I mean the single numbers, not the average) I get for the average of the eclosion day, and do an anova on that ratio (So I would have liked normalized the two populations to the eclosion day and it would not matter anymore that their raw number are different), but something tells me I would mess up all the statistics: I am not sure if this approach is statistically correct.<br />
 <br />
Another possibility I was thinking is to use some kind of correlation or regression analysis, but in the example I found I never found this approach on my kind of experiment, so i am not sure how to work this out. In the various handbook it always show this kind of approach to smaller experiments, like comparing blood pressure and coffee assumption, and with much less numbers.<br />
 <br />
If you have any idea how this problem could be solved, I would be grateful!]]></description>
		<pubDate>Mon, 16 Jan 2012 13:49:34 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/24033-timeline-statistics/</guid>
	</item>
	<item>
		<title>Searching genome with specifed number of Ns between two known sequences</title>
		<link>http://www.protocol-online.org/forums/topic/24030-searching-genome-with-specifed-number-of-ns-between-two-known-sequences/</link>
		<description><![CDATA[Do you know if there's a way to search a genome in BLAST for sequences separated by, say, 5-30 N? E.g. AAAA (N)<sub class='bbc'>5-30</sub> TTTT?]]></description>
		<pubDate>Mon, 16 Jan 2012 09:37:05 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/24030-searching-genome-with-specifed-number-of-ns-between-two-known-sequences/</guid>
	</item>
	<item>
		<title>Probability of E.coli 2 mutations that lead to a lysine residue.</title>
		<link>http://www.protocol-online.org/forums/topic/24018-probability-of-ecoli-2-mutations-that-lead-to-a-lysine-residue/</link>
		<description><![CDATA[Just doing a reality check to see if my number is right.  What would the probability of a single lysine residue arising from an alanine (has to be a two base change) with a mutation rate per base per replication of 5E-10.<br />
 <br />
Thanks Much]]></description>
		<pubDate>Sat, 14 Jan 2012 17:25:02 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/24018-probability-of-ecoli-2-mutations-that-lead-to-a-lysine-residue/</guid>
	</item>
	<item>
		<title>Need help with basic bioinformatics</title>
		<link>http://www.protocol-online.org/forums/topic/23993-need-help-with-basic-bioinformatics/</link>
		<description><![CDATA[Hey,<br />
 <br />
Just a little background on myself, I'm a new grad student in a microbiology lab. I have begun to realize that bioinformatics is really important and I admit I'm atrocious in this area. I want a sort of starting point to better my knowledge. Mainly I want to be able to look up DNA and protein sequences, and also make comparisons between two sequences (DNA or protein) which I have looked up. Sadly I don't even know how to do this properly. If anyone can give me some step by step info on how to do this, it would be great.<br />
 <br />
Thanks]]></description>
		<pubDate>Thu, 12 Jan 2012 02:11:42 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23993-need-help-with-basic-bioinformatics/</guid>
	</item>
	<item>
		<title>curve fitting - growth modelling</title>
		<link>http://www.protocol-online.org/forums/topic/23986-curve-fitting-growth-modelling/</link>
		<description><![CDATA[hi guys,<br />
 <br />
I'm just wondering, has any of you done any statistical modeling of plant growth? I know, plants normally have a s-shape growth pattern, but under certain conditions (e.g. disease, drought) they don't follow this kind of growth curves.<br />
 <br />
My data shows more of a parabolic curve, does any of you have idea how to fit a parabolic equation to describe that curve?  Thanks in advance.<br />
 <br />
tj.]]></description>
		<pubDate>Wed, 11 Jan 2012 16:05:15 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23986-curve-fitting-growth-modelling/</guid>
	</item>
	<item>
		<title>Statistical Tools</title>
		<link>http://www.protocol-online.org/forums/topic/23944-statistical-tools/</link>
		<description><![CDATA[<p class='bbc_left'>A very interresting site with several statistical tools and you can do it online </p><p class='bbc_left'>(Clinical, Research, Calculators, Probabilities, Distributions, Frequency Data, Proportions, Ordinal Data, Correlation & Regression, t-Tests & Procedures, ANOVA, ANCOVA, Miscellanea)</p><br />
 <br />
<a href='http://faculty.vassar.edu/lowry/VassarStats.html' class='bbc_url' title='External link' rel='nofollow external'>http://faculty.vassa...assarStats.html</a>]]></description>
		<pubDate>Sun, 08 Jan 2012 12:01:47 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23944-statistical-tools/</guid>
	</item>
	<item>
		<title><![CDATA[Help with verifying that I'm looking at polymorphisms?]]></title>
		<link>http://www.protocol-online.org/forums/topic/23926-help-with-verifying-that-im-looking-at-polymorphisms/</link>
		<description><![CDATA[Sorry that I couldn't think of a better sounding title!<br />
 <br />
Basically I've had to do some Bioinformatic work recently having never ever done or even heard of the subject before, so I'm a complete novice as to how to use all the websites like NCBI, Vega Sanger, BLAST, BLAT... etc.! and I'm finding it really hard to understand exactly what it is that I'm looking at.<br />
 <br />
I've been trying to find the number of polymorphisms in a particular gene, human TRIM5. I found one website with it on <a href='http://www.uniprot.org/uniprot/Q9C035' class='bbc_url' title='External link' rel='nofollow external'>http://www.uniprot.org/uniprot/Q9C035</a> and simply counted the 'Natural Variants' to come up with a number of 10. However, I would be really grateful if anybody could give me some guidance. Are these natural variants actually the same as polymorphisms? Have I actually managed to include all of the polymorphisms (if they are indeed 'natural variants') or am I accidentally only looking at a subset of polymorphisms?<br />
 <br />
Just to re-iterate the fact that I really feel out in deep water with this subject and so if you do give any answers please feel free to phrase any reply as if I'm a little bit stupid! I really want to understand what all these things are, feeling lost is a very horrible feeling!<br />
 <br />
Any help would be MUCH appreciated.<br />
 <br />
Thanks in advance!]]></description>
		<pubDate>Fri, 06 Jan 2012 15:04:18 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23926-help-with-verifying-that-im-looking-at-polymorphisms/</guid>
	</item>
	<item>
		<title>TCGA portal</title>
		<link>http://www.protocol-online.org/forums/topic/23861-tcga-portal/</link>
		<description><![CDATA[Can anyone tell me how to read the results of TCGA portal?<br />
what does AgilentG4502A_07 z-score <span style='color: orangered'>&#8595; of 85% mean?</span>]]></description>
		<pubDate>Thu, 29 Dec 2011 05:39:45 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23861-tcga-portal/</guid>
	</item>
	<item>
		<title>BLAST of ITS4 ITS5 sequence</title>
		<link>http://www.protocol-online.org/forums/topic/23848-blast-of-its4-its5-sequence/</link>
		<description>Hi, I have a question about BLASTing. Sequencing was done for a fungal strain. The PCR primers were ITS4 (reverse) and ITS5 (forward). I used the ITS4 sequence to BLAST and it gave a different results than the ITS5 BLAST; both results are quite close - results showed strain from the same genus; but the top result was different. Did I use BLAST wrongly? Or to rephrase the question, since I have one reverse sequence and one forward sequence, which should I use to BLAST or will need to do anything to the sequence before BLASTing? Thanks in advance!</description>
		<pubDate>Mon, 26 Dec 2011 14:54:20 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23848-blast-of-its4-its5-sequence/</guid>
	</item>
	<item>
		<title>Comparison of gene structure</title>
		<link>http://www.protocol-online.org/forums/topic/23816-comparison-of-gene-structure/</link>
		<description><![CDATA[Hello everyone,<br />
 <br />
I have two genomic sequences, and I would like to know if there is any relationship between them regarding the protein coding genes (same proteins?, same order?). They definitely contain mutations on the nucleotide level, so a simple nucleotide BLAST with two sequences as queries is a bit pointless. They are already annotated, so I could theoretically check the corresponding Genbank-Files and look up each and every entry there and do protein BLASTs with the sequences, but that would "a bit" tedious. Moreover, I don't trust automated annotations. My question therefore is: does anybody know a good program that translates two sequences in all reading frames and compares the results?<br />
 <br />
Thanks for suggestions in advance!<br />
 <br />
Greetings<br />
Freemason]]></description>
		<pubDate>Tue, 20 Dec 2011 17:46:54 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23816-comparison-of-gene-structure/</guid>
	</item>
	<item>
		<title>Guessing candidate miRNAs from mRNA microarray</title>
		<link>http://www.protocol-online.org/forums/topic/23803-guessing-candidate-mirnas-from-mrna-microarray/</link>
		<description><![CDATA[Hi every one,<br />
<br />
There are several articles that have worked on the effect of a certain compound (of interest in my project) on mice. They have results from mRNA microarrays and have identified differentially expressed genes and pathways in the response of mice to the chemical.<br />
Is there a way that I use this information to "guess" the putative miRNA alterations in human response to the aforementioned compound?<br />
 <br />
Thank you in Advance]]></description>
		<pubDate>Sun, 18 Dec 2011 16:56:39 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23803-guessing-candidate-mirnas-from-mrna-microarray/</guid>
	</item>
	<item>
		<title>repeat masking of gene enriched genomic reads on 454 platform</title>
		<link>http://www.protocol-online.org/forums/topic/23794-repeat-masking-of-gene-enriched-genomic-reads-on-454-platform/</link>
		<description><![CDATA[<span style='color: #555555'><span style='font-family: Arial, Helvetica, sans-serif'><p class='bbc_left'>hi,</p></span></span>[/left]<span style='color: #555555'><span style='font-family: Arial, Helvetica, sans-serif'><p class='bbc_left'>i have sequenced gene enriched genomic reads on 454 platform. After removing reads of organeller genome, i want to assemble the filtered reads on gsAssembler. Being genomic reads, should i repeat mask it prior to assembly? Whether gsAssembler can assemble repeat masked reads?</p></span></span>[/left]<span style='color: #555555'><span style='font-family: Arial, Helvetica, sans-serif'><p class='bbc_left'>KM (India)</p></span></span>[/left]<span style='color: #555555'><span style='font-family: Arial, Helvetica, sans-serif'></span></span>[/left]]]></description>
		<pubDate>Fri, 16 Dec 2011 15:58:33 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23794-repeat-masking-of-gene-enriched-genomic-reads-on-454-platform/</guid>
	</item>
	<item>
		<title>repeat masking of gene enriched genomic reads on 454 platform</title>
		<link>http://www.protocol-online.org/forums/topic/23793-repeat-masking-of-gene-enriched-genomic-reads-on-454-platform/</link>
		<description><![CDATA[<span style='color: #555555'><span style='font-family: Arial, Helvetica, sans-serif'><p class='bbc_left'>hi,</p></span></span>[/left]<span style='color: #555555'><span style='font-family: Arial, Helvetica, sans-serif'><p class='bbc_left'>i have sequenced gene enriched genomic reads on 454 platform. After removing reads of organeller genome, i want to assemble the filtered reads on gsAssembler. Being genomic reads, should i repeat mask it prior to assembly? Whether gsAssembler can assemble repeat masked reads?</p></span></span>[/left]<span style='color: #555555'><span style='font-family: Arial, Helvetica, sans-serif'><p class='bbc_left'>KM (India)</p></span></span>[/left]<span style='color: #555555'><span style='font-family: Arial, Helvetica, sans-serif'></span></span>[/left]]]></description>
		<pubDate>Fri, 16 Dec 2011 15:57:53 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23793-repeat-masking-of-gene-enriched-genomic-reads-on-454-platform/</guid>
	</item>
	<item>
		<title>Sample Size determination</title>
		<link>http://www.protocol-online.org/forums/topic/23678-sample-size-determination/</link>
		<description><![CDATA[I am doing dose response analysis (ie probit or logistic) with data that is highly variable.  One of my colleuges suggested using Jackknife or Bootstrapping to determine the best sample size.  However, I haven't been able to find an example of this in the literature.  Does anyone have any suggestions?]]></description>
		<pubDate>Tue, 06 Dec 2011 19:35:08 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23678-sample-size-determination/</guid>
	</item>
	<item>
		<title>BLAST of shRNA sequence</title>
		<link>http://www.protocol-online.org/forums/topic/23677-blast-of-shrna-sequence/</link>
		<description><![CDATA[Dear all,<br />
 <br />
I want to blast my shrna sequence to see if it is specific. I used megablast (NCBI) and nucleotide collection as database. I got many many things other than my gene of interest.<br />
																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																																											<br />
Could you please tell me, how I can determine if my shrna is specific enough to my gene of interest? Is "total score" a good representative for specificity? Should I use nucleotide collection as database or refseq-rna (this one gives less hits)?<br />
 <br />
Thanks]]></description>
		<pubDate>Tue, 06 Dec 2011 18:32:48 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23677-blast-of-shrna-sequence/</guid>
	</item>
	<item>
		<title>PAML tree design</title>
		<link>http://www.protocol-online.org/forums/topic/23576-paml-tree-design/</link>
		<description><![CDATA[I'm new to PAML and phylogenetic analysis in general.  I'm running a bunch of alignments through PAML to calculate dN/dS per branch and I need to give it a species tree but I'm not familiar with the parenthetical notation it requires.  Can anyone help me out? <br />
 <br />
The species I'm using are:<br />
Human, Chimp, Orangutan, Gorilla, Mouse, Rhesus Macaque, & Cow<br />
 <br />
Thanks!]]></description>
		<pubDate>Sun, 27 Nov 2011 20:20:20 +0000</pubDate>
		<guid>http://www.protocol-online.org/forums/topic/23576-paml-tree-design/</guid>
	</item>
</channel>
</rss>
