HSRA is a MapReduce-based parallel tool for mapping reads from RNA sequencing (RNA-seq) experiments. RNA-seq analyses typically begin by mapping reads to a reference genome in order to determine the location from which the reads were originated, which is a very time-consuming step. This tool allows bioinformatics researchers to efficiently distribute their mapping tasks over the nodes of a cluster by combining a fast multithreaded spliced aligner (HISAT2) with Apache Hadoop, which is a distributed computing framework for scalable Big Data processing.

HSRA currently supports single-end and paired-end read alignments from FASTQ/FASTA datasets. Moreover, our tool uses the Hadoop Sequence Parser (HSP) library (link above) to efficiently read the input datasets stored on the Hadoop Distributed File System (HDFS), being able to process datasets compressed with Gzip and BZip2 codecs.

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow HSRA

HSRA Web Site

You Might Also Like
Enterprise-Grade Monitoring - Zero Compromises Icon
Enterprise-Grade Monitoring - Zero Compromises

PRTG delivers deep visibility and proactive alerts for complex IT. Monitor, analyze, and optimize - all in one platform.

Managing a large, distributed IT environment demands more than basic monitoring. PRTG provides a unified view of your entire infrastructure - across sites, clouds, and hybrid setups. Advanced analytics, customizable dashboards, and granular access controls empower your team to detect issues early and respond fast. Automate reporting, ensure compliance, and scale effortlessly as your network grows. With PRTG, you get reliability, flexibility, and the insights you need to keep your business running at peak performance.
Start Your Free PRTG Trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of HSRA!

Additional Project Details

Operating Systems

Linux

Intended Audience

Healthcare Industry, Information Technology, Science/Research

User Interface

Command-line, Console/Terminal

Programming Language

Java

Related Categories

Java Bio-Informatics Software, Java Big Data Tool

Registered

2018-02-06