Type: | Package |
Title: | Annotate the Gene Symbols for Probes in Expression Array |
Version: | 0.1.7 |
Date: | 2022-11-12 |
Maintainer: | Yonghe Xia <xiayh17@gmail.com> |
Description: | We curated 147 of expression array, from 3 species(human,mouse,rat), 3 companies('Affymetrix','Illumina','Agilent'), by aligning the 'Fasta' sequences of all probes of each platform to their corresponding reference genome, and then annotate them to genes. |
License: | Apache License (≥ 2) |
Encoding: | UTF-8 |
URL: | https://github.com/jmzeng1314/AnnoProbe |
LazyData: | true |
RoxygenNote: | 7.1.1 |
Depends: | R (≥ 3.4.0) |
Imports: | ggplot2, DT, ggpubr, pheatmap, utils, methods, Biobase, stats, xml2, httr, curl |
biocViews: | DataImport, Microarray, OneChannel, SAGE, Software, TwoChannel |
Suggests: | limma, GEOquery, knitr, rmarkdown |
NeedsCompilation: | no |
Packaged: | 2022-11-12 11:39:35 UTC; xiayh |
Author: | Jianming Zeng [aut], Yujia Xiang [aut], Yonghe Xia [ctb, cre] |
Repository: | CRAN |
Date/Publication: | 2022-11-14 08:30:11 UTC |
An example dataset
Description
A dataset containing genes_expr, group_list, DEG
Usage
GSE27533
Format
A list with 6 elements:
- genes_expr
genes_expr, genes_expr
- DEG
DEG, DEG
...
An example dataset
Description
A dataset containing eSet, probes_expr, probe2gene, genes_expr, group_list, DEG
Usage
GSE95166
Format
A list with 6 elements:
- probes_expr
probes_expr, probes_expr
- probe2gene
probe2gene, probe2gene
- genes_expr
genes_expr, genes_expr
- group_list
group_list, group_list
- DEG
DEG, DEG
...
Annotate gene IDs according to GTF files in gencode
Description
annoGene will return a data.frame of gene information or write them to a file (csv or html format). The user should set a list of genes to be annotated, with "ENSEMBL" or "SYMBOL" style.
Usage
annoGene(IDs, ID_type, species = "human", out_file)
Arguments
IDs |
a list of genes |
ID_type |
the type of input IDs, should be "ENSEMBL" or "SYMBOL" |
species |
choose human or mouse, or rat, default: human |
out_file |
the filename, should be ".csv" or ".html". |
Value
a dataframe which columns contain genesymbol, biotypes, ensembl ids and the positions of genes
Examples
IDs <- c("DDX11L1", "MIR6859-1", "OR4G4P", "OR4F5")
ID_type = "SYMBOL"
annoGene(IDs, ID_type)
annoGene(IDs, ID_type,out_file = tempfile(fileext = ".html"))
annoGene(IDs, ID_type,out_file = tempfile(fileext = ".csv"))
Check whether the input gpl in our platform list or not
Description
Check whether the input gpl in our platform list or not
Usage
checkGPL(GPL = NULL)
Arguments
GPL |
GPL(GEO platform) number, eg: GPL570 |
Value
returns a boolean value
Examples
checkGPL('GPL570')
checkGPL('GPL15314')
checkGPL('GPL10558')
Check a list of genes how they show difference.
Description
How does a gene or a list of genes show difference between two group. The boxplot or heatmap will be drawed. just a wrap function of ggpubr and pheatmap.
Usage
check_diff_genes(gene, genes_expr, group_list)
Arguments
gene |
A vector contains all gene ids of interest. Gene ids should be gene symbol. |
genes_expr |
An expression matrix, the rownames should be gene symbol. |
group_list |
A vector contains the group information of each samples in expression matrix |
Value
A figure : boxplot or heatmap
Examples
attach(GSE95166)
check_diff_genes('LRCH3',genes_expr,group_list )
x=DEG$logFC
names(x)=rownames(DEG)
cg=c(names(head(sort(x),100)), names(tail(sort(x),100)))
check_diff_genes(cg,genes_expr,group_list )
draw a heatmap for DEG result
Description
deg_heatmap
will draw a heatmap for you.
Usage
deg_heatmap(deg, genes_expr, group_list, topn = 20)
Arguments
deg |
the result from limma. |
genes_expr |
the expression matrix |
group_list |
a vector |
topn |
the number of genes in heatmap, default:20 |
Value
a ggplot2 style figure.
Examples
attach(GSE27533)
deg_heatmap(DEG,genes_expr,group_list)
draw a volcano for DEG result
Description
deg_volcano
will draw a volcano for you.
Usage
deg_volcano(need_deg, style = 1, p_thred = 0.05, logFC_thred = 1)
Arguments
need_deg |
should be 3 columns : gene, logFC, p.value(or p.adjust |
style |
you can try 1 or 2, default: 1 |
p_thred |
default:0.05 |
logFC_thred |
default:1 |
Value
a ggplot2 style figure.
Examples
deg=GSE27533$DEG
need_deg=data.frame(symbols=rownames(deg), logFC=deg$logFC, p=deg$P.Value)
deg_volcano(need_deg,2)
deg_volcano(need_deg,1)
Filter expression matrix based on annotation
Description
filterEM
will annotate the probes in expression matrix and remove the duplicated gene symbols.
because there will be many probes mapped to same genes, we will only keep the max value one.
Usage
filterEM(probes_expr, probe2gene)
Arguments
probes_expr |
is an expression matrix which rownames are probes of probe2gene and each column is a sample |
probe2gene |
the first column is probes and the second column is corresponding gene symbols |
Value
a expression matrix which has been filtered duplicated gene symbols
Examples
attach(GSE95166)
# head(probes_expr)
# head(probe2gene)
genes_expr <- filterEM(probes_expr,probe2gene)
# head(genes_expr)
Download expression dataset by GSE id
Description
geoChina
will download the expression matrix and phenotype data as ExpressionSet format
from cloud in mainland China,
it's a alternative method for getGEO function from GEOquery package.
geoChina('gse1009') is the same as eSet=getGEO('gse1009', getGPL = F)
Usage
geoChina(gse = "GSE2546", mirror = "tencent", destdir = getwd())
Arguments
gse |
input GSE id, such as GSE1009, GSE2546, gse1009. |
mirror |
"tencent" only for now. |
destdir |
The destination directory for data downloads. |
Value
a list of ExpressionSet, which contains the expression matrix and phenotype data
Examples
## Not run:
geoChina('GSE1009',destdir=tempdir())
## End(Not run)
Get all GPL list in our package
getGPLList
returns all the GPL number checklist stored in package
Description
Get all GPL list in our package
getGPLList
returns all the GPL number checklist stored in package
Usage
getGPLList()
Value
a data.frame which contains the gpl and name of array.
Get Probe Annotation
Description
idmap
returns probe annotations for input gpl
Usage
idmap(gpl = "GPL570", type = "bioc", mirror = "tencent", destdir = getwd())
Arguments
gpl |
GPL(GEO platform) number, eg: GPL570 |
type |
source of probe anntation stored, one of "pipe", "bioc", "soft", default:"pipe" |
mirror |
"tencent" only for now |
destdir |
The destination directory for data downloads. |
Value
probe annotaions
Examples
ids=idmap('GPL570',destdir=tempdir())
ids=idmap('GPL570',type='soft',destdir=tempdir())
ids=idmap('GPL18084',type='pipe',destdir=tempdir())
Print GPL information
Description
Print GPL information
Usage
printGPLInfo(GPL = NULL)
Arguments
GPL |
GPL(GEO platform) number, eg: GPL570 |
Value
print detail information of the input GEO platform
Examples
printGPLInfo('GPL93')