Type: Package
Title: Ensemble Based Machine Learning Approach for Predicting Methylation States
Version: 0.1.1
Author: Dipro Sinha [aut, cre], Sunil Archak [aut], Dwijesh Chandra Mishra [aut], Tanwy Dasmandal [aut], Md Yeasin [aut]
Maintainer: Dipro Sinha <diprosinha@gmail.com>
Description: DNA methylation (6mA) is a major epigenetic process by which alteration in gene expression took place without changing the DNA sequence. Predicting these sites in-vitro is laborious, time consuming as well as costly. This 'EpiSemble' package is an in-silico pipeline for predicting DNA sequences containing the 6mA sites. It uses an ensemble-based machine learning approach by combining Support Vector Machine (SVM), Random Forest (RF) and Gradient Boosting approach to predict the sequences with 6mA sites in it. This package has been developed by using the concept of Chen et al. (2019) <doi:10.1093/bioinformatics/btz015>.
License: GPL-3
Encoding: UTF-8
RoxygenNote: 7.2.0
Imports: stats, devtools, tidyverse, seqinr, Biostrings, splitstackshape, entropy, party, stringr, tibble, doParallel, parallel, e1071, caret, randomForest, gbm, foreach, ftrCOOL, iterators
NeedsCompilation: no
Packaged: 2023-06-02 16:06:43 UTC; YEASIN
Repository: CRAN
Date/Publication: 2023-06-04 14:20:02 UTC

Important Features

Description

Find out the most suitable features for predicting sequences with 6mA sites.

Usage

ImpFeatures(Fastafile,Species)

Arguments

Fastafile

Sequence file in .fasta format

Species

Model organism

Value

test_data_input: A matrix containing important features for prediction

References

Chen, W., Lv, H., Nie, F., & Lin, H. (2019). i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome. Bioinformatics, 35(16), 2796-2800.

Examples


library(EpiSemble)
data<-system.file("exdata/test.fasta", package = "EpiSemble")
imp<-ImpFeatures(Fastafile=data, Species="Rice")


Epigenetic Modification Prediction

Description

Predicting sequences with 6mA sites.

Usage

epiPred(FastaData,Species)

Arguments

FastaData

Sequence file (.fasta format)

Species

Model organism

Value

MethStatus: Sequences with their methylation state (methylated or non-methylated)

References

Chen, W., Lv, H., Nie, F., & Lin, H. (2019). i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome. Bioinformatics, 35(16), 2796-2800.

Examples


library(EpiSemble)
data<-system.file("exdata/test.fasta", package = "EpiSemble")
pred<-epiPred(FastaData=data, Species="Rice")

mirror server hosted at Truenetwork, Russian Federation.