<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>BioDiscovery</title>
	<atom:link href="http://www.blog.biodiscovery.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.blog.biodiscovery.com</link>
	<description>News &#38; Insights About CGH, SNP, NGS - CNV Analysis Solutions</description>
	<lastBuildDate>Fri, 24 May 2013 13:28:32 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Diploid Recentering: When, Why and How</title>
		<link>http://www.blog.biodiscovery.com/2013/03/21/diploid-recentering-when-why-and-how/</link>
		<comments>http://www.blog.biodiscovery.com/2013/03/21/diploid-recentering-when-why-and-how/#comments</comments>
		<pubDate>Thu, 21 Mar 2013 04:36:43 +0000</pubDate>
		<dc:creator>Andrea OHara</dc:creator>
				<category><![CDATA[Data Analysis Insights]]></category>
		<category><![CDATA[Nexus Tips & Tricks]]></category>

		<guid isPermaLink="false">http://www.blog.biodiscovery.com/?p=513</guid>
		<description><![CDATA[<p>It is well established that cancer frequently undergoes copy number changes, which is why copy number is frequently evaluated in cancer samples for diagnostic and research purposes.  However, when beginning copy number analysis of a sample using a SNP array, it is important to run a quick sanity check to validate that the baseline calling [...]]]></description>
				<content:encoded><![CDATA[<p>It is well established that cancer frequently undergoes copy number changes, which is why copy number is frequently evaluated in cancer samples for diagnostic and research purposes.  However, when beginning copy number analysis of a sample using a SNP array, it is important to run a quick sanity check to validate that the baseline calling is in fact calling diploid regions.  In simple terms: Are the “normal” areas normal?</p>
<p>SNP arrays give information for both copy number and b-allele frequency.  With most copy number analysis, median recentering is selected by default.  This is appropriate when a sample is close to diploid.  However, when the overall ploidy strays due to extensive gains or losses, as is typical in many cancer samples, median recentering can lead to false calling.</p>
<p>Therefore, when evaluating an individual cancer sample, there are a few red flags to watch out for:</p>
<p>- Are there extensive copy neutral (baseline) areas with allelic imbalance (purple)?</p>
<p>- <span style="line-height: 1.6em">Are there extensive areas of copy number loss (red) without any corresponding allele change (purple allelic imbalance or yellow loss of heterozygosity)?</span></p>
<p>- <span style="line-height: 1.6em"> Are there extensive areas of gain (blue) without any corresponding allelic imbalance (purple)?</span></p>
<p>Here is one such example:</p>
<p><a href="http://www.blog.biodiscovery.com/2013/03/21/diploid-recentering-when-why-and-how/example-median/" rel="attachment wp-att-514"><img class="aligncenter size-large wp-image-514" alt="" src="http://www.blog.biodiscovery.com/wp-content/uploads/Example-Median-1024x469.jpg" width="705" height="322" /></a></p>
<p>As you can see in this example, there are extensive “gains” of chromosomes 2, 5, 7, 9, 10, 12, 19, 20,and 22, with no corresponding allelic imbalance, while the copy number loss regions all show appropriate loss of heterozygosity.  If we look at the whole genome view, we can get further information:</p>
<p><a href="http://www.blog.biodiscovery.com/2013/03/21/diploid-recentering-when-why-and-how/example-whole-genome-median/" rel="attachment wp-att-515"><img class="aligncenter size-large wp-image-515" alt="" src="http://www.blog.biodiscovery.com/wp-content/uploads/Example-Whole-Genome-Median-1024x464.jpg" width="705" height="319" /></a></p>
<p>The top panel shows the copy number calling (note this is an Affymetrix MIP array, so everything is shown in a linear copy number scale).  We can see many areas called as loss (~-0.4) and gain (~0.4), but almost nothing is called at the baseline state (0).  When we look at the bottom B-allele frequency panel, we can see many regions of balanced heterozygosity, as noted by the typical 3 band pattern (spots at 0, 0.5 and 1.0).  These areas include chromosomes 2, 5, 7, 9, 10, 12, 19, 20,and 22; all of the areas that are also marked as “gained”.  Chromosome 14 shows a typical 4 band allelic imbalance pattern, but is marked as high amplitude gain (~1.2).</p>
<p>We can recenter this sample to the balanced regions of heterozygosity identified above, thus defining these chromosomes as diploid.  Within <a href="http://www.biodiscovery.com/software/nexus-copy-number/">Nexus Copy Number software</a>, this is done by:</p>
<p>1. <span style="line-height: 1.6em">Adding the factor “Diploid Regions” on the data set tab</span></p>
<p>2. <span style="line-height: 1.6em">Naming the diploid regions in the sample (chr2,chr5,chr7, chr9, chr10, chr12, chr19, chr20, chr22)</span></p>
<p>3. <span style="line-height: 1.6em">Adjust the settings to Recenter: Diploid Regions</span></p>
<p>4. <span style="line-height: 1.6em">Highlight the selected sample and Reset</span></p>
<p>5. <span style="line-height: 1.6em">Re-process the sample with the new settings by selecting View</span></p>
<p>Once recentered, this essentially shifts the top copy number panel while maintaining the identical lower B-allele frequency panel, as shown in the full genome view below:</p>
<p><a href="http://www.blog.biodiscovery.com/2013/03/21/diploid-recentering-when-why-and-how/example-whole-genome-recenter/" rel="attachment wp-att-516"><img class="aligncenter size-large wp-image-516" alt="" src="http://www.blog.biodiscovery.com/wp-content/uploads/Example-Whole-Genome-Recenter-1024x464.jpg" width="705" height="319" /></a></p>
<p>Now this sample looks as expected: we see areas of copy number loss in the top panel are associated with loss of heterozygosity in the bottom panel for chromosomes 1p, 3, 4, 6, 8, 11, 13, 15, 16, 17, and 21.  We see copy number gain on chromosome 14 is associated with allelic imbalance on the bottom panel.  Furthermore a baseline diploid state is associated with a normal 3 band heterozygous pattern for chromosomes 2, 5, 7, 9, 10, 12, 19, 20, and 22.  This is reiterated in the summary view below:</p>
<p><a href="http://www.blog.biodiscovery.com/2013/03/21/diploid-recentering-when-why-and-how/example-recenter/" rel="attachment wp-att-517"><img class="aligncenter size-large wp-image-517" alt="" src="http://www.blog.biodiscovery.com/wp-content/uploads/Example-Recenter-1024x469.jpg" width="705" height="322" /></a></p>
<p>As you can see, if we had not recentered this sample, we would have had extensive false positive calling.  Again, default median recentering is appropriate in samples that are close to diploid, and it is always a good place to start.  For a sample like the one above, where the actual ploidy is closer to 1.5, further recentering to specified regions may be required.  However, paying attention to the red flags outlined above will help you to recenter the sample and should yield positive results for your downstream analysis.</p>
<p>Drilling down to the whole genome view can also be useful for <a href="http://www.blog.biodiscovery.com/2013/02/05/estimation-of-aberrant-cell-percentage-in-tumor-normal-cell-mixture-2/">estimating the percentage of aberrant cells in a sample</a>, or <a href="http://www.blog.biodiscovery.com/2013/02/13/identifying-chromothripsis-in-tumor-samples-using-nexus-copy-number-2/">identifying potential areas of chromothripsis</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.blog.biodiscovery.com/2013/03/21/diploid-recentering-when-why-and-how/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to Identify Chromothripsis in Tumor Samples</title>
		<link>http://www.blog.biodiscovery.com/2013/02/13/identifying-chromothripsis-in-tumor-samples-using-nexus-copy-number-2/</link>
		<comments>http://www.blog.biodiscovery.com/2013/02/13/identifying-chromothripsis-in-tumor-samples-using-nexus-copy-number-2/#comments</comments>
		<pubDate>Wed, 13 Feb 2013 22:26:03 +0000</pubDate>
		<dc:creator>Andrea OHara</dc:creator>
				<category><![CDATA[Data Analysis Insights]]></category>

		<guid isPermaLink="false">http://www.blog.biodiscovery.com/?p=499</guid>
		<description><![CDATA[<p>Chromothripsis, or chromosomal shattering, occurs at a frequency of 1%-5% in most cancer types, though has been described as being upwards of 25% in bone cancer (Cell. 2011 Jan 7;144(1):9-10). It has been associated with poor prognosis (Cancer Res. 2012 Dec 27).  Chromothripsis has even been found to occur in the germline of some developmental [...]]]></description>
				<content:encoded><![CDATA[<p>Chromothripsis, or chromosomal shattering, occurs at a frequency of 1%-5% in most cancer types, though has been described as being upwards of 25% in bone cancer (Cell. 2011 Jan 7;144(1):9-10). It has been associated with poor prognosis (Cancer Res. 2012 Dec 27).  Chromothripsis has even been found to occur in the germline of some developmental disorders (Hum Mol Genet. 2011 May 15;20(10):1916-24, Cell Rep. 2012 Jun 28;1(6):648-55.).  This complex chromosomal rearrangement is thought to occur when the chromosome breaks apart into tens or even hundreds of pieces and is then rejoined through nonhomologous end-joining.  While a primary indicator of chromothripsis is a highly rearranged sequence, several hallmark features of chromothripsis can be observed in <a href="http://www.biodiscovery.com/software/nexus-copy-number/">BioDiscovery Nexus Copy Number software</a>, including alternating copy number states, high level of breakpoints, and  loss of heterozygosity in the lower state.</p>
<p><a href="http://www.blog.biodiscovery.com/2013/02/13/identifying-chromothripsis-in-tumor-samples-using-nexus-copy-number-2/chr6-3/" rel="attachment wp-att-504"><img class="aligncenter size-large wp-image-504" src="http://www.blog.biodiscovery.com/wp-content/uploads/Chr61-1024x361.jpg" alt="" width="705" height="248" /></a></p>
<p>In the example shown above, we can clearly see several regions of copy number loss (red regions in middle logR panel), which are associated with appropriate loss of heterozygosity (yellow regions in the bottom B-allele frequency panel).  These regions alternate with copy number neutral regions (uncolored baseline in logR panel, uncolored 3 band pattern in bottom B-allele frequency panel) and regions of copy gain (blue regions in logR panel, purple 4 band allelic imbalance pattern in B-allele frequency panel).</p>
<p><a href="http://www.blog.biodiscovery.com/2013/02/13/identifying-chromothripsis-in-tumor-samples-using-nexus-copy-number-2/genome-2/" rel="attachment wp-att-505"><img class="aligncenter size-large wp-image-505" src="http://www.blog.biodiscovery.com/wp-content/uploads/Genome-1024x461.jpg" alt="" width="705" height="317" /></a></p>
<p>Even from a whole genome overview, we can easily identify the alternating copy states and altered heterozygosity of chromosome 6.  This is just one of the many ways <a href="http://www.biodiscovery.com/software/nexus-copy-number/">BioDiscovery Nexus Copy Number software</a> can be used to identify some of the frequent aberrations found in cancer.</p>
<p>You may also like to read the article <a href="http://www.blog.biodiscovery.com/2013/02/05/estimation-of-aberrant-cell-percentage-in-tumor-normal-cell-mixture-2/">here</a> on how to estimate the percentage of aberrant cells in a tumor/normal cell mixture or the article <a href="http://www.blog.biodiscovery.com/2013/02/01/paired-analysis/">here </a>on how to paired analysis to identify somatic changes in tumor/normal pairs.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.blog.biodiscovery.com/2013/02/13/identifying-chromothripsis-in-tumor-samples-using-nexus-copy-number-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Evaluating sequencing methods against microarrays for copy number analysis</title>
		<link>http://www.blog.biodiscovery.com/2013/02/12/how-to-evaluating-sequencing-methods-against-microarrays-for-copy-number-analysis/</link>
		<comments>http://www.blog.biodiscovery.com/2013/02/12/how-to-evaluating-sequencing-methods-against-microarrays-for-copy-number-analysis/#comments</comments>
		<pubDate>Tue, 12 Feb 2013 16:16:56 +0000</pubDate>
		<dc:creator>Louis Culot</dc:creator>
				<category><![CDATA[Industry Insight]]></category>
		<category><![CDATA[aCGH]]></category>
		<category><![CDATA[Cancer]]></category>
		<category><![CDATA[copy number]]></category>
		<category><![CDATA[Exome-CNV]]></category>
		<category><![CDATA[LOH]]></category>
		<category><![CDATA[NGS]]></category>
		<category><![CDATA[sequencing]]></category>

		<guid isPermaLink="false">http://www.blog.biodiscovery.com/?p=478</guid>
		<description><![CDATA[<p>Sequencing is increasing in popularity as a platform for copy number analysis. But how does it stack up to traditional methods (typically microarrays) ? To answer this, or at least begin to, the question has to be reframed around “which” array technology is being compared to “which” sequencing methods, what the scientific goals are (broadly, [...]]]></description>
				<content:encoded><![CDATA[<p>Sequencing is increasing in popularity as a platform for copy number analysis. But how does it stack up to traditional methods (typically microarrays) ? To answer this, or at least begin to, the question has to be reframed around “which” array technology is being compared to “which” sequencing methods, what the scientific goals are (broadly, and as relates to copy number analysis), and, equally important, the nature of the samples.</p>
<p>Looking at a recent paper (<a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2919738/">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2919738/</a>), cost-effective sequencing techniques can be used to detect large copy number changes. The low-depth techniques have low resolution (e.g., in the paper 0.04x coverage resolved CNVs of 15kb), but are cost-competitive. At increased, but still low depth (e.g., 0.3x), the resolution and cost compares favorably to some aCGH platforms, but the technique has some limitations. Obviously, focal events smaller than the window will not be resolved. Perhaps less apparent, the technique cannot be used to determine the zygosity of the input DNA. For both constitutional and cancer cases, this limitation makes the technique poorly suited to finding copy-neutral LOH (loss of heterozygosity) regions. However, unlike aCGH, the technique can be used to find the ploidy of the sample by summing the number of reads per chromosome and dividing by the chromosome length, then relative read depths can be used to infer copy number at points along the genome.</p>
<p>Simple CGH arrays also have the problem of determining zygosity, but SNP array technology (such as the Affy CytoScan HD and OncoScan products, Illumina OMNI arrays, and Agilent CGH+SNP) solves it. By using SNP arrays, zygosity of the sample within various regions of the genome can be imputed through the allelic balance (commonly reported as the B-allele frequency). Interpretation of SNP array data is further complicated, especially in cancer samples, due to sample heterogeneity; but in most cases the underlying aberrations can be detected with a high degree of confidence, along with copy-neutral LOH.</p>
<p>The limitations of low-depth sequencing are also circumvented, easily enough, at a sufficient depth of coverage that allows accurate base calling. Here, not only can zygosity be deduced, but small structural variants as well as somatic mutations can be detected, creating a more robust picture than arrays alone. Though, problems still exist. At this level, the cost and analysis of sequencing data may become problematic, particularly if the primary goal of genome-wide sequencing is for copy number detection (as opposed to a by-product result from some other objective). For instance, techniques to detect LOH from sequencing (including exome sequencing) have been developed, but require a depth of coverage which significantly increases the cost. The Exome-CNV paper tested the technique to find copy number and LOH using about a 40x depth of coverage on melanoma samples (<a href="http://bioinformatics.oxfordjournals.org/content/early/2011/08/09/bioinformatics.btr462.full.pdf">http://bioinformatics.oxfordjournals.org/content/early/2011/08/09/bioinformatics.btr462.full.pdf</a>).</p>
<p>Archival FFPE samples pose challenges for both arrays and NGS methods. The low-input DNA, sample heterogeneity (since archival FFPE are commonly cancer samples), and sample degradation all make analyzing FFPE samples difficult. The current wisdom is that sequencing methods ought to have an advantage here, since they don’t require DNA amplification, and the degraded DNA can be sequenced directly. But at least one array method, using molecular inverted probes (MIP), promises to work at least as well on FFPE samples, and perhaps better (<a href="http://www.sciencedirect.com/science/article/pii/S2210776212001627">http://www.sciencedirect.com/science/article/pii/S2210776212001627</a>)</p>
<p>From where we stand today, the way to arrive at a clear answer as to whether sequencing or arrays makes sense for copy number analysis is to frame the question in terms of the nature of the sample, analysis objectives, and cost constraints. You may conclude, for example, that low-depth sequencing is a cost-effective and robust alternative to a low-resolution aCGH platform. But, if you work in cancer samples, or constitutional samples requiring LOH detection, depending on your goals you may decide the best approach is a high-density SNP array combined with targeted re-sequencing. In any event, the technology is moving at a rapid pace and you can be reasonably assured that whatever is the right answer ‘today’ is likely to change.</p>
<p>Please comment and add your thoughts &#8212; there&#8217;s so much happening at the moment that this likely (a) contains errors, (b) is out of date, and/or (c) misses some key points.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.blog.biodiscovery.com/2013/02/12/how-to-evaluating-sequencing-methods-against-microarrays-for-copy-number-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Genome annotations &#8211; now on the Downloads page!</title>
		<link>http://www.blog.biodiscovery.com/2013/02/07/genome-annotations-now-on-the-downloads-page/</link>
		<comments>http://www.blog.biodiscovery.com/2013/02/07/genome-annotations-now-on-the-downloads-page/#comments</comments>
		<pubDate>Thu, 07 Feb 2013 17:58:56 +0000</pubDate>
		<dc:creator>shaliniverma</dc:creator>
				<category><![CDATA[Data Analysis Insights]]></category>
		<category><![CDATA[Nexus Tips & Tricks]]></category>
		<category><![CDATA[adding a new organism]]></category>
		<category><![CDATA[c. elegans]]></category>
		<category><![CDATA[chimpanzee]]></category>
		<category><![CDATA[CNV]]></category>
		<category><![CDATA[copy number variation]]></category>
		<category><![CDATA[cow]]></category>
		<category><![CDATA[dog]]></category>
		<category><![CDATA[faciparum]]></category>
		<category><![CDATA[genome annotations]]></category>
		<category><![CDATA[LOH]]></category>
		<category><![CDATA[medaka]]></category>
		<category><![CDATA[monkey]]></category>
		<category><![CDATA[Nexus Copy Number]]></category>
		<category><![CDATA[Nexus Solo]]></category>
		<category><![CDATA[orangutan]]></category>
		<category><![CDATA[organisms]]></category>
		<category><![CDATA[zebrafish]]></category>

		<guid isPermaLink="false">http://www.blog.biodiscovery.com/?p=473</guid>
		<description><![CDATA[<p>Did you know that you can use data from virtually any genome to identify copy number variations (CNV) and LOH using <a title="BioDiscovery Nexus Copy Number" href="http://www.biodiscovery.com/software/nexus-copy-number/">Nexus Copy Number</a> and <a title="BioDiscovery Nexus Solo" href="http://www.biodiscovery.com/software/nexus-copy-number/">Nexus Solo</a>? When you first launch the software and create a new project, you will see two organisms (human and mouse) [...]]]></description>
				<content:encoded><![CDATA[<p>Did you know that you can use data from virtually any genome to identify copy number variations (CNV) and LOH using <a title="BioDiscovery Nexus Copy Number" href="http://www.biodiscovery.com/software/nexus-copy-number/">Nexus Copy Number</a> and <a title="BioDiscovery Nexus Solo" href="http://www.biodiscovery.com/software/nexus-copy-number/">Nexus Solo</a>? When you first launch the software and create a new project, you will see two organisms (human and mouse) in the “Organism” dropdown. But you are not limited to these two organisms. These two are included with the installer as they are the most commonly used and we want to keep the installers as small as possible. But you can easily add a new organism if you don’t work on human or mouse and you can also add new genome annotation files in the future as your project scope changes.</p>
<p>Now we’ve made it even easier to obtain additional genomes by providing currently available annotation files as zip archive downloads on our <strong><a title="BioDiscovery Software Downloads" href="http://www.biodiscovery.com/support/downloads/#organism">Downloads page</a></strong>. If you don’t see your organism here, don’t worry, you can easily create these annotation files yourself and add to Nexus. Check out the Appendix on “Adding a New Organism” in the User Manual. If you still need help, contact <a title="BioDiscovery Product Support" href="http://www.biodiscovery.com/product-support/">BioDiscovery Technical Support</a> and we will help in creating the necessary files. As we generate new annotation files, we will add them to the Downloads page.</p>
<p>Here are some of the <strong>organisms currently available for Nexus Copy Number and Nexus Solo</strong>: cow, cat, dog, monkey, horse, platypus, chimpanzee, orangutan, rat, pig, C. elegans, cichlid, zebrafish, medaka, corn, chicken, yeast, Streptococcus, P. falciparum, and C. raciborskii.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.blog.biodiscovery.com/2013/02/07/genome-annotations-now-on-the-downloads-page/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Estimation of Aberrant Cell Percentage in Tumor Normal Cell Mixture</title>
		<link>http://www.blog.biodiscovery.com/2013/02/05/estimation-of-aberrant-cell-percentage-in-tumor-normal-cell-mixture-2/</link>
		<comments>http://www.blog.biodiscovery.com/2013/02/05/estimation-of-aberrant-cell-percentage-in-tumor-normal-cell-mixture-2/#comments</comments>
		<pubDate>Tue, 05 Feb 2013 00:25:32 +0000</pubDate>
		<dc:creator>Zhiwei Che</dc:creator>
				<category><![CDATA[Data Analysis Insights]]></category>
		<category><![CDATA[aberrant cell percentage]]></category>
		<category><![CDATA[BAF]]></category>
		<category><![CDATA[mosaicity]]></category>
		<category><![CDATA[normal cell contamination]]></category>
		<category><![CDATA[tumor normal mixture]]></category>

		<guid isPermaLink="false">http://www.blog.biodiscovery.com/?p=447</guid>
		<description><![CDATA[<p>Cancer samples, especially solid tumors, usually contain a mixture of tumor and normal cells.  Using both the SNP and copy number information provided by SNP arrays (Affymetrix or Illumina), the % aberrant cells can be estimated.  Here we are going to discuss a simple formula for such calculation based on one-copy loss event.</p> <p>Figure below [...]]]></description>
				<content:encoded><![CDATA[<p>Cancer samples, especially solid tumors, usually contain a mixture of tumor and normal cells.  Using both the SNP and copy number information provided by SNP arrays (Affymetrix or Illumina), the % aberrant cells can be estimated.  Here we are going to discuss a simple formula for such calculation based on one-copy loss event.</p>
<p>Figure below shows the B-Allele Frequency (BAF) bands in a region where a mixture of aberrant cells with one-copy loss and non-aberrant/normal cells is present.</p>
<p><a href="http://www.blog.biodiscovery.com/2013/02/05/estimation-of-aberrant-cell-percentage-in-tumor-normal-cell-mixture-2/test2-2/" rel="attachment wp-att-468"><img class="aligncenter size-full wp-image-468" title="test2" src="http://www.blog.biodiscovery.com/wp-content/uploads/test2.jpg" alt="" width="624" height="237" /></a></p>
<p>With no aberrant cells (0%), or only normal cells, a typical 3-band BAF pattern is displayed on the left, indicating AA (0% BAF), AB (50% BAF), and BB (100% BAF) alleles.  With only aberrant cells (100%) of one-copy loss, a typical 2-band BAF pattern is shown on the right, a representation of either A (0% BAF) or B (100% BAF) allele.  When there is a mixture of these two types of cells, two middle BAF bands will appear.  And the location of the two middle bands is dependent upon the % of aberrant cells.</p>
<p>Because the two middle bands are mirror images of each other around 50% BAF, only <strong>the lower band</strong> is needed for the calculation of the % aberrant cells.  A simple formula is deduced for such calculation:</p>
<p><a href="http://www.blog.biodiscovery.com/2013/02/05/estimation-of-aberrant-cell-percentage-in-tumor-normal-cell-mixture-2/test-2/" rel="attachment wp-att-469"><img class="aligncenter size-full wp-image-469" title="test" src="http://www.blog.biodiscovery.com/wp-content/uploads/test.jpg" alt="" width="368" height="64" /></a></p>
<p><span style="line-height: 1.6em;">And based on this formula, different % aberrant cells and their related BAF values are listed in this table:</span></p>
<p style="text-align: left;">
<p> <a href="http://www.blog.biodiscovery.com/2013/02/05/estimation-of-aberrant-cell-percentage-in-tumor-normal-cell-mixture-2/test1-2/" rel="attachment wp-att-470"><img class="aligncenter size-full wp-image-470" title="test1" src="http://www.blog.biodiscovery.com/wp-content/uploads/test1.jpg" alt="" width="557" height="74" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.blog.biodiscovery.com/2013/02/05/estimation-of-aberrant-cell-percentage-in-tumor-normal-cell-mixture-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paired Analysis</title>
		<link>http://www.blog.biodiscovery.com/2013/02/01/paired-analysis/</link>
		<comments>http://www.blog.biodiscovery.com/2013/02/01/paired-analysis/#comments</comments>
		<pubDate>Fri, 01 Feb 2013 21:09:31 +0000</pubDate>
		<dc:creator>Alessio Venier</dc:creator>
				<category><![CDATA[Industry Insight]]></category>

		<guid isPermaLink="false">http://www.blog.biodiscovery.com/?p=435</guid>
		<description><![CDATA[<p>Copy number variants (CNVs) are recognised to be part of the natural genetic variation in humans. Although there is still work to do to fully understand biology involved, it is likely that CNVs are responsible for a considerable proportion of phenotypic variation between individuals. On the other hand, copy number alterations (CNAs) are somatic changes [...]]]></description>
				<content:encoded><![CDATA[<p>Copy number variants (CNVs) are recognised to be part of the natural genetic variation in humans. Although there is still work to do to fully understand biology involved, it is likely that CNVs are responsible for a considerable proportion of phenotypic variation between individuals. On the other hand, copy number alterations (CNAs) are somatic changes in copy number that are characteristic of the genome of cancerous cells. These CNAs can affect cancer genes such as tumor suppressor genes and oncogenes. </p>
<p>Both somatic copy number alterations (CNAs) and germline copy number variants (CNVs) that are prevalent in healthy individuals can appear as recurrent changes in comparative genomic hybridization (CGH) analyses of tumors. This represent a challenge during the analysis since researchers are keen to identify regions that are frequently gained or lost in patients with a particular cancer.</p>
<p>What strategies can be implemented to distinguish between CNA and CNV changes?</p>
<p>One possible solution would be to collect data from healthy and tumor tissue samples from the same individual and run a matched paired analysis.</p>
<p>When performing matched paired analysis, generally a tumor sample is compared to its matched normal sample to identify only those events in the tumor that are not present in the normal tissue. Two files are loaded for each sample: data from the tumor tissue and the data from the matched normal tissue. (i.e. normal tissue from the same patient). The two files are combined into one result after the values in the normal sample are subtracted from the corresponding probes in the tumor sample. In the resulting profile, the Log ratios and B-Allele Frequencies (BAF) are shown after subtraction.<br />
<a href="http://www.blog.biodiscovery.com/2013/02/01/paired-analysis/image001/" rel="attachment wp-att-436"><img src="http://www.blog.biodiscovery.com/wp-content/uploads/image001-300x112.jpg" alt="Paired Analysis result" title="image001" width="300" height="112" class="alignnone size-medium wp-image-436" /></a><br />
In BioDiscovery Nexus Copy Number software, researchers can perform matched paired analysis by loading both the tumor and normal data for each sample. After the values from the normal sample are subtracted from the tumor sample, these corrected probe level values are used to perform segmentation and calling. To see how to load tumor/normal paired samples in Nexus, please view the <a href="http://www.biodiscovery.com/downloads/pdfs/HowToGuides/DataLoading/NexusCopyNumberQuickSheetforPairedAnalysis.pdf">How To Load and Process Data for Matched Paired Analysis Guide</a> from our <a href="http://www.agdsweb.com/test_site/how-to-guides/">How To Guides collection</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.blog.biodiscovery.com/2013/02/01/paired-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to analyze NGS data in Nexus</title>
		<link>http://www.blog.biodiscovery.com/2013/01/15/how-to-analyze-ngs-data-in-nexus/</link>
		<comments>http://www.blog.biodiscovery.com/2013/01/15/how-to-analyze-ngs-data-in-nexus/#comments</comments>
		<pubDate>Tue, 15 Jan 2013 16:42:39 +0000</pubDate>
		<dc:creator>Raja Keshavan</dc:creator>
				<category><![CDATA[Industry Insight]]></category>
		<category><![CDATA[BAM]]></category>
		<category><![CDATA[ngCGH]]></category>
		<category><![CDATA[NGS]]></category>
		<category><![CDATA[VCF]]></category>

		<guid isPermaLink="false">http://www.blog.biodiscovery.com/?p=422</guid>
		<description><![CDATA[<p>This post deals with loading NGS data into Nexus for copy number variation and structural variants.</p> <p>BAM files produced from NGS have to be processed before loading into Nexus. Variant calls in VCF format can be loaded into the next version of Nexus to be released shortly. VCF files can be created from BAM files [...]]]></description>
				<content:encoded><![CDATA[<p>This post deals with loading NGS data into Nexus for copy number variation and structural variants.</p>
<p>BAM files produced from NGS have to be processed before loading into Nexus. Variant calls in VCF format can be loaded into the next version of Nexus to be released shortly. VCF files can be created from BAM files with tools like <a href="http://www.broadinstitute.org/gsa/wiki/index.php/The_Genome_Analysis_Toolkit" target="_blank">GATK</a> Or <a href="http://varscan.sourceforge.net/" target="_blank">VarScan</a> Or samtools <a href="http://samtools.sourceforge.net/mpileup.shtml" target="_blank">mpileup</a> option.</p>
<p>For Exome data, where matching sample and reference BAM files are present, Dr Sean Davis’ Python script ngCGH can be used to create a text file (with log2ratio of read-depth counts) and loaded into Nexus with NGS custom data type for copy number. More details can be seen in this <a href="http://www.biodiscovery.com/2012/05/16/copy-number-estimation-from-exome-and-genome-sequencing-data/" target="_blank">webinar</a>.</p>
<p>Details on running the Python script are available at:</p>
<p><a href="http://github.com/seandavi/ngs" target="_blank">http://github.com/seandavi/ngs<br />
</a><a href="https://github.com/seandavi/ngs/blob/master/scripts/ngCGH.py" target="_blank">https://github.com/seandavi/ngs/blob/master/scripts/ngCGH.py</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.blog.biodiscovery.com/2013/01/15/how-to-analyze-ngs-data-in-nexus/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>PAG XXI starts in a few days!</title>
		<link>http://www.blog.biodiscovery.com/2013/01/09/pag-xxi-starts-in-a-few-days/</link>
		<comments>http://www.blog.biodiscovery.com/2013/01/09/pag-xxi-starts-in-a-few-days/#comments</comments>
		<pubDate>Wed, 09 Jan 2013 22:19:45 +0000</pubDate>
		<dc:creator>shaliniverma</dc:creator>
				<category><![CDATA[Events and Announcements]]></category>
		<category><![CDATA[agricultural]]></category>
		<category><![CDATA[agrigenomics]]></category>
		<category><![CDATA[animal]]></category>
		<category><![CDATA[CNV]]></category>
		<category><![CDATA[copy number analysis]]></category>
		<category><![CDATA[copy number variation]]></category>
		<category><![CDATA[gene expression]]></category>
		<category><![CDATA[genomics]]></category>
		<category><![CDATA[ImaGene]]></category>
		<category><![CDATA[miRNA]]></category>
		<category><![CDATA[mRNA]]></category>
		<category><![CDATA[Nexus Copy Number]]></category>
		<category><![CDATA[Nexus Expression]]></category>
		<category><![CDATA[NGS]]></category>
		<category><![CDATA[PAG]]></category>
		<category><![CDATA[plant]]></category>
		<category><![CDATA[Plant and Animal Genome Meeting]]></category>

		<guid isPermaLink="false">http://www.blog.biodiscovery.com/?p=414</guid>
		<description><![CDATA[<p>The annual <a href="http://www.intlpag.org/2013/">Plant and Animal Genome Sciences (PAG XXI) meeting</a> is starting this weekend in San Diego, CA. This is the largest genomics conference focusing on non-human species and attendance is expected at over 2800. BioDiscovery will be exhibiting at <a href="http://www.mapyourshow.com/shows/index.cfm?Show_ID=pag13&#38;exhid=25215&#38;booth=407&#38;hall=C">booth #407</a> so stop by and talk to our application specialists on how [...]]]></description>
				<content:encoded><![CDATA[<p>The annual <a href="http://www.intlpag.org/2013/">Plant and Animal Genome Sciences (PAG XXI) meeting</a> is starting this weekend in San Diego, CA. This is the largest genomics conference focusing on non-human species and attendance is expected at over 2800. BioDiscovery will be exhibiting at <a href="http://www.mapyourshow.com/shows/index.cfm?Show_ID=pag13&amp;exhid=25215&amp;booth=407&amp;hall=C">booth #407</a> so stop by and talk to our application specialists on how our products can help you with your agricultural genomics projects. We have products for copy number variation analysis from microarray and NGS technologies and for mRNA/miRNA expression analysis. Our products support research on any organism for which genome data is available. We have customers working on organisms ranging from E. coli to chimpanzees and humans including dog, pig, maize, and more. To keep our software installer size to a minimum, only human and mouse genomes are packaged in the installer. Those researchers requiring any other organism should contact our <a href="http://www.biodiscovery.com/product-support/">Support team</a> and our Support engineers will provide the organism files. If we don’t already have the files, we will create them for you. Soon we will be adding these organism files to our website for direct and easy download. In the meantime, just contact our <a href="http://www.biodiscovery.com/product-support/">Support team</a> and they will gladly send you the appropriate files. Get a preview of our product offerings now and then come to our booth to talk with our application engineers and get a live demo.</p>
<p><a href="http://www.biodiscovery.com/software/nexus-copy-number/">Nexus Copy Number</a> (copy number and allelic events from microarray and NGS)</p>
<p><a href="http://www.biodiscovery.com/software/nexus-expression/">Nexus Expression</a> (miRNA and mRNA expression analysis)</p>
<p><a href="http://www.biodiscovery.com/software/imagene/">ImaGene</a> (image analysis)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.blog.biodiscovery.com/2013/01/09/pag-xxi-starts-in-a-few-days/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>From Raw Data to Copy Number Calls &#8211; Data processing workflow in Nexus Copy Number</title>
		<link>http://www.blog.biodiscovery.com/2012/12/27/from-raw-data-to-copy-number-calls-data-processing-workflow-in-nexus-copy-number/</link>
		<comments>http://www.blog.biodiscovery.com/2012/12/27/from-raw-data-to-copy-number-calls-data-processing-workflow-in-nexus-copy-number/#comments</comments>
		<pubDate>Thu, 27 Dec 2012 23:57:40 +0000</pubDate>
		<dc:creator>Zhiwei Che</dc:creator>
				<category><![CDATA[Data Analysis Insights]]></category>
		<category><![CDATA[Calling Thresholds]]></category>
		<category><![CDATA[Copy Number Calls]]></category>
		<category><![CDATA[FASST2]]></category>
		<category><![CDATA[Raw Data]]></category>
		<category><![CDATA[segmentation algorithm]]></category>
		<category><![CDATA[Significance Threshold]]></category>
		<category><![CDATA[SNP-FASST2]]></category>
		<category><![CDATA[systematic correction]]></category>

		<guid isPermaLink="false">http://www.blog.biodiscovery.com/?p=392</guid>
		<description><![CDATA[<p style="text-align: left;" align="center">The so-called “Raw Data” for the data analysis in Nexus Copy Number can be intensity values for array probes (Affymetrix CEL files), logRatios after normalization (Agilent Feature Extraction results files and Illumina final report files), or even copy number state values (Affymetrix OncoScan data).  Let’s take a look at how Nexus Copy [...]]]></description>
				<content:encoded><![CDATA[<p style="text-align: left;" align="center">The so-called “Raw Data” for the data analysis in Nexus Copy Number can be intensity values for array probes (Affymetrix CEL files), logRatios after normalization (Agilent Feature Extraction results files and Illumina final report files), or even copy number state values (Affymetrix OncoScan data).  Let’s take a look at how Nexus Copy Number handles the different “Raw Data” and what is common in the overall data processing workflow.</p>
<p>Affymetrix CEL files (intensity values) need to be normalized to generate logRatios before proceeding to copy number analysis.  A normal reference file generated from the HapMap pooled samples is used for this purpose.  Depending on the Affymetrix array types, different reference files are generated for 500K, SNP 6.0, and CytoScan HD.  Each CEL file from the test samples is used against the corresponding reference file to get logRatios for all the probes on the array.  A systematic correction process follows to straighten up the systematic waviness pattern of the data distribution, which results mainly from the G-C contents of the probes and the samples, as well as from other factors.  The data is finally analyzed to get the copy number calls using the default SNP-FASST2 segmentation algorithm.</p>
<p>LogRatio data, such as Agilent Feature Extraction results files or Illumina final report files, are directly processed by FASST2 (for aCGH arrays) or SNP-FASST2 (for SNP arrays) to get copy number calls, usually preceded by the systematic correction step for possible data waviness.</p>
<p>A special case is Affymetrix OncoScan data files, Copy_Number.txt and Assays.txt.  The “Raw Data” are probe copy number state values, which are linear values rather than logRatios.  However, Nexus Copy Number processes the data similarly to the logRatios, i.e. systematic correction followed by copy number segmentation with SNP-FASST2.</p>
<p>As described above, the last step for the data analysis workflow is the copy number segmentation, which results in the copy number calls for the copy number segments.  What this step ultimately does is to decide whether a group of probes should be in a distinct segment or stay with the current segment of neighboring probes.  The default copy number segmentation algorithm in Nexus Copy Number is FASST2 (for aCGH arrays) or SNP-FASST2 (for SNP arrays), which is based on the popular Hidden Markov Model (HMM) algorithm.  However, unlike the conventional HMM, no integer copy number states (e.g. 0, 1, 2, 3, and 4) are used.  Instead, the states are defined by the copy number calling thresholds, which are based on logRatios (linear values in Affymetrix OncoScan Data) and can be easily adjusted by the end user.  The number of segments is determined by the Significance Threshold setting, which is equivalent to a P-value cut-off and is also adjustable to the user.  The probability that a group of probes belong to a certain copy number state is compared to this Significance Threshold and a new segment is generated if the Significant Threshold is surpassed.  The value of the new segment, median value from the logRatios (linear values in Affymetrix OncoScan Data) of the group of probes in the segment, is then compared to the calling thresholds (one copy gain, one copy loss, gain with two or more copies, and homozygous loss) to get the corresponding copy number calls.</p>
<p>In summary, there are three major steps in the data analysis workflow from “Raw Data” to copy number calls:</p>
<ol>
<li>Probe intensities to logRatios</li>
<li>Systematic correction for data waviness</li>
<li>Copy number segmentation for copy number calls</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://www.blog.biodiscovery.com/2012/12/27/from-raw-data-to-copy-number-calls-data-processing-workflow-in-nexus-copy-number/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Need software but don’t have the funds?</title>
		<link>http://www.blog.biodiscovery.com/2012/12/19/need-software-but-dont-have-the-funds/</link>
		<comments>http://www.blog.biodiscovery.com/2012/12/19/need-software-but-dont-have-the-funds/#comments</comments>
		<pubDate>Wed, 19 Dec 2012 16:18:23 +0000</pubDate>
		<dc:creator>shaliniverma</dc:creator>
				<category><![CDATA[Events and Announcements]]></category>
		<category><![CDATA[BioDiscovery]]></category>
		<category><![CDATA[childhood diseases]]></category>
		<category><![CDATA[childhood illness]]></category>
		<category><![CDATA[genomic]]></category>
		<category><![CDATA[pediatric maladies]]></category>
		<category><![CDATA[pediatric research]]></category>
		<category><![CDATA[software donation program]]></category>

		<guid isPermaLink="false">http://www.blog.biodiscovery.com/?p=376</guid>
		<description><![CDATA[<p><a href="http://www.blog.biodiscovery.com/2012/12/19/need-software-but-dont-have-the-funds/treewithsnowflakes/" rel="attachment wp-att-377"></a>&#8216;Tis the season of giving and BioDiscovery shares that sentiment! We have recently launched a <a title="BioDiscovery Software Donation Program" href="http://www.biodiscovery.com/corporate/software-donation-program/">software donation program</a> to benefit pediatric research. It is heartbreaking to see young ones struggle with illness and not be able to share in the same joys of childhood as their peers. [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.blog.biodiscovery.com/2012/12/19/need-software-but-dont-have-the-funds/treewithsnowflakes/" rel="attachment wp-att-377"><img class="alignleft  wp-image-377" title="treewithsnowflakes" src="http://www.blog.biodiscovery.com/wp-content/uploads/treewithsnowflakes-300x225.jpg" alt="" width="239" height="179" /></a>&#8216;Tis the season of giving and BioDiscovery shares that sentiment! We have recently launched a <a title="BioDiscovery Software Donation Program" href="http://www.biodiscovery.com/corporate/software-donation-program/">software donation program</a> to benefit pediatric research. It is heartbreaking to see young ones struggle with illness and not be able to share in the same joys of childhood as their peers. The scientific community has made a lot of progress in finding new treatments as well as creating methods for earlier diagnosis. Often though, funding becomes a barrier and great ideas or projects aren’t able to germinate. Dr. Soheil Shams, President of BioDiscovery, said it well when he stated “<strong>Limitations on funding should not be an impediment in finding better treatments, especially for pediatric maladies</strong>.” I could not agree more and am thrilled that BioDiscovery is able to donate software licenses to those who lack funding. Hopefully this will assist researchers make breakthrough discoveries to accelerate diagnosis and treatment of childhood diseases. BioDiscovery has committed $100,000 worth of software licenses yearly to this program. More information on the program can be found on the <a title="BioDiscovery Software Donation Program" href="http://www.biodiscovery.com/corporate/software-donation-program/">application form</a> and the <a title="BioDiscovery Software Donation Program press release" href="http://www.biodiscovery.com/2012/12/17/biodiscovery-initiates-software-donation-program-to-benefit-pediatric-research/">press release</a>.</p>
<p>And on behalf of everyone at BioDiscovery, <strong><span style="color: #ff0000;">H</span><span style="color: #339966;">a<span style="color: #ff0000;">p</span>p</span><span style="color: #ff0000;">y</span> <span style="color: #ff0000;">H</span><span style="color: #339966;">o<span style="color: #ff0000;">l</span>i<span style="color: #ff0000;">d</span>a<span style="color: #ff0000;">y</span></span></strong><span style="color: #339966;">s</span><span style="color: #ff0000;">! </span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.blog.biodiscovery.com/2012/12/19/need-software-but-dont-have-the-funds/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
