Editorial Novel Bioinformatics Approaches for Analysis of High-Throughput Biological...

EditorialNovel Bioinformatics Approaches for Analysis ofHigh-Throughput Biological Data

Julia Tzu-Ya Weng,1,2 Li-Ching Wu,3 Wen-Chi Chang,4 Tzu-Hao Chang,5

Tatsuya Akutsu,6 and Tzong-Yi Lee1,2

1Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan2Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan 320, Taiwan3Institute of Systems Biology and Bioinformatics, National Central University, Taoyuan 320, Taiwan4Institute of Tropical Plant Sciences, National Cheng Kung University, Tainan 701, Taiwan5Graduate Institute of Biomedical Informatics, Taipei Medical University, Taipei 110, Taiwan6Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto 611-0011, Japan

Correspondence should be addressed to Tzong-Yi Lee; [email protected]

Received 2 October 2014; Accepted 2 October 2014; Published 28 December 2014

Copyright © 2014 Julia Tzu-Ya Weng et al. This is an open access article distributed under the Creative Commons AttributionLicense, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properlycited.

1. Introduction

With the advent of high-throughput technologies, molec-ular biology is experiencing a surge in both growth andscope. As the amount of experimental data increases, thedemand for the development of ways to analyze these resultsalso increases. For example, the next-generation sequencing(NGS) technology has generated various sequencing data.Mass spectrometry- (MS-) based experiments are also widelyapplied in proteomics studies. Rapidly advancing technolo-gies have offered us the opportunities to examine the genome,transcriptome, and proteome in comprehensive ways. Yet,extracting meaningful information from this vast sea of dataand approaching biological problems from systems biologyperspective have become the Holy Grail in bioinformat-ics. The main focus of this special issue is novelty: newideas, original research findings, and practical applicationsthat intend to answer biological questions through high-throughput technologies. The papers in this special issuepresent methods and experiments that demonstrate novelplatforms and systems and new bioinformatics tools andmodels, as well as new data-analytical methods for high-throughput biological data.

In this special issue, U. Rosani et al. attempted tounravel the genome ofMytilus galloprovincialis, the Mediter-raneanmussel, through a target capture and high-throughput

massive sequencing approach to reduce whole genomesequencing cost and effort. However, inferences fromsequencing data rely heavily on careful experimental design,as well as efficient detection and removal of artifacts. Whileanalyzing restriction-based reduced representation genomicdata, D. C. Ilut et al. demonstrated that, by setting an optimalclustering threshold, false homozygosity or heterozygositycan be effectively minimized.

With the advancement of genomic researches, the num-ber of sequences processed in comparative methods hasgrown immensely. E. A. Marucci et al. developed a parallelalgorithm formultiple sequence similarities calculation usingthe k-mers counting method. Their tests showed that thealgorithm provides a very good scalability and a nearlylinear speedup. In “A de novo genome assembly algorithm forrepeats and nonrepeats,” Z. Dai et al. proposed a new genomeassembly algorithm called the sliding window assembler(SWA), which assembles repeats and nonrepeats by adoptinga new overlapping extension strategy to extend each seed andimplementing a compensationalmechanism for low coveragedatasets. Results of their analysis on three datasets support thepracticability and efficiency of SWA as a promising algorithmfor NGS data.

High-throughput technology holds great promises for theefficient investigation of transcriptomes, but the enormousamount of gene expression data demands effective analytical

Hindawi Publishing CorporationBioMed Research InternationalVolume 2014, Article ID 814092, 3 pageshttp://dx.doi.org/10.1155/2014/814092

2 BioMed Research International

tools. By combining three gene-set analytical methods in oneR statistical package, C.-Y. Chien et al. presented MAVT-gsa, offering a systematic pipeline for the identification ofsignificant gene-set modules from a set of gene expressiondata. Often, genes are coexpressed and coregulated or interacttogether to orchestrate a series of biological processes. Todecipher the complex genetic networks associatedwith differ-ent cellular functions, M. Huerta et al. proposed to study theexpression dependence between not only coexpressed genesbut also sets of coexpressed genes.

In an attempt to predict the survival time in patients withoral squamous cell carcinoma, O. Hamidi et al. demonstratedthat the three sparse variable selection techniques, whenapplied on gene expressionmicroarray data, were able to yieldbetter prediction results. For bladder cancer, Y.-H. Wong etal. proposed a statistical method based on carcinogenesisrelevance values (CRVs) to identify 152 and 50 significantproteins and subsequently generated novel protein-proteininteraction (PPI) network markers for early and late stagebladder cancer. Their findings not only provide new cluesspecific to cancer but also offer cancer researchers newdirections for targeted cancer therapy.

In metagenomics, C.-M. Chiu et al. developed a pipelinefor the systematic analysis of the association between gutflora and obesity through high-throughput sequencing andbioinformatics approaches. Eighty-one stool samples werecollected and the V4 region of 16S rRNA genes was selectedfor metagenomics analysis.The results demonstrate that bac-terial communities in the gut could be clustered into the N-like (normal) group and OB-like (obese) group. Remarkably,most of the normal samples were clustered in the N-likegroup, and theOB-like groupwas enrichedwith case samples,indicating that bacterial communities in the gut were highlyassociated with obesity. The results provide new insights intothe correlation of gut flora with the rising trend in obesity.

In order to explore the molecular mechanism of floundersex determination and development, Z. Fan et al. appliedRNA-seq technology to investigate the transcriptomes offlounder gonads, obtaining 22,253,217 and 19,777,841 quali-fied reads from the ovary and testes, respectively.These readswere jointly assembled into 97,233 contigs. Among them,2,193 contigs were identified to be differentially expressed inthe ovary and 887 in the testes. Following annotation, severalsex-related biological pathways including ovarian steroidoge-nesis and estrogen signaling pathways were revealed in theflounder for the first time.

Several bioinformatics tools are now being employed toanalyze high-throughput expression data. In an attempt tostudy the molecular changes as a result of radiation exposure,K.-F. Lee et al. designed a set of expression microarrayexperiments studying the changes in gene and microRNAexpression in peripheral mononuclear blood cells treatedwith varying doses of radiation. Combined with the existingtools for biochip analysis, K.-F. Lee et al. identified the variouspathways associated with the exposure to differing doses ofradiation and the potential gene-microRNA interactions thatregulate these pathway changes.

The rapid increase in microRNA NGS data demands thedevelopment of comprehensive and customized tools for data

analysis. In “Large-scale investigation of human TF-miRNArelations based on coexpression profiles,” C.-H. Chien et al.developed a computational strategy to investigate the tran-scription factors of human miRNA genes on a global scale.The proposed method helps enhance our understanding ofthe transcriptional regulatory mechanisms of miRNAs. Onthe other hand, in “miRSeq: a user-friendly standalone toolkitfor sequencing quality evaluation and miRNA profiling,” C.-T. Pan et al. introduced a new tool for NGS data alignmentthat not only is easy to implement but also offers variousmethods for evaluating sequencing quality and providesprofiles for up to 105 species for users to compare with.These studies demonstrate that customizability, easy access,and user-friendliness are crucial to high-throughput dataanalysis.

In the analysis of protein catalytic sites, C.-S. Yu et al.found that the side chain of catalytic residues usually pointsto the center of the catalytic site. The results demonstratethat the proposed method (EXIA2) could outperform theexisting methods on several benchmark datasets that includeover 1,200 enzyme structures. In “High-throughput functionalscreening of steroid substrates with wild-type and chimericP450 enzymes,” P. Urban et al. identified the structuralfeatures of steroid-based substrates catalyzed by CYP1Aenzymes containing wild-type and synthetic variants. Theresults are interesting and may help extend the scope ofknowledge surrounding the structural properties of enzymesrecognizing and metabolizing exo- and endogenous sub-strates including drugs. In “Bioinformatic prediction ofWSSV-host protein-protein interaction,” Z. Sun et al. used bioinfor-matics methods to identify possible protein-protein interac-tions between white spot syndrome virus (WSSV) and itsshrimphost.Their findings provide certain insights to readersin the relevant fields. In “MPINet: metabolite pathway identifi-cation via coupling of global metabolite network structure andmetabolomic profile,” F. Li et al. demonstrate a network-basedmetabolite pathway identification method, which identifiesnovel pathways related to disease.

In clinical medicine, length of stay (LOS) in the intensivecare unit (ICU) of spontaneous intracerebral hemorrhage(sICH) patients is one of the most important issues. C.-L. Chan et al. showed that the threshold of a prolongedICU stay is a good indicator of hospital utilization in ICHpatients.This indicator can be improved using quality controlmethods such as complications prevention and efficiency ofICU bed management. Patients’ stay in ICUs and in hospitalswill be shorter if integrated care systems are established. In“Ultrasonographic fetal growth charts: an informatic approachby quantitative analysis of the impact of ethnicity on diagnosesbased on a preliminary report on Salentinian population,”A. Tinelli et al. provide customized fetal growth charts toformulate an accurate fetal assessment and to avoid unnec-essary obstetric interventions. The fetal growth assessmentis crucial to the health of newborns. Results suggest acareful reexamination for the appropriateness of continueduse of currently adopted reference growth curves to classifyneonates.

This special issue presents novel applications or method-ologies of biomedical or bioinformatics analysis.The selected

BioMed Research International 3

articles show the importance of integrating diverse ideas andmultidisciplinary knowledge to answer complex biologicalquestions through high-throughput technologies.

Acknowledgments

We would like to thank the authors for their scientific contri-bution to this special issue. Also the anonymous reviewers aregreatly appreciated for their critical comments that improvethe quality of the papers published in this special issue.

Julia Tzu-Ya WengLi-Ching Wu

Wen-Chi ChangTzu-Hao ChangTatsuya AkutsuTzong-Yi Lee

Submit your manuscripts athttp://www.hindawi.com

Hindawi Publishing Corporationhttp://www.hindawi.com Volume 2014

Anatomy Research International

PeptidesInternational Journal of


Hindawi Publishing Corporation http://www.hindawi.com

International Journal of

Volume 2014

Zoology


Molecular Biology International

GenomicsInternational Journal of


The Scientific World JournalHindawi Publishing Corporation http://www.hindawi.com Volume 2014


BioinformaticsAdvances in

Marine BiologyJournal of



Signal TransductionJournal of


BioMed Research International

Evolutionary BiologyInternational Journal of



Biochemistry Research International

ArchaeaHindawi Publishing Corporationhttp://www.hindawi.com Volume 2014


Genetics Research International


Advances in

Virolog y

Hindawi Publishing Corporationhttp://www.hindawi.com

Nucleic AcidsJournal of

Volume 2014

Stem CellsInternational



Enzyme Research


International Journal of

Microbiology

Date post:	07-Jun-2020
Category:	Documents
Upload:	others
View:	0 times
Download:	0 times

Editorial Novel Bioinformatics Approaches for Analysis of High-Throughput Biological...

Documents