Abstract
Recent advances in sequencing technology have considerably increased the throughput and decreased the cost of short-read sequencing by using high-density patterned flow cells. However, high rates of cross-contamination between multiplexed libraries have been observed on data from these machines, likely due to index-switching during exclusion amplification1,2. Here, we demonstrate that a computational correction procedure based on the Sylvester equation removed 80-90% of the false positive expression signal and eliminated spurious clustering. The computational correction procedure can therefore be used to rescue aspects of affected sequence data so that researchers can take advantage of cost-effective sequencing on patterned flow cells.
Copyright
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.