Abstract

Accumulated evidence suggests that the widely expressed long-non-coding RNAs (lncRNAs) are involved in biogenesis. Some aberrant lncRNAs are closely related to pathological changes, for instance, in cancer. Both in tumorigenesis and cancer progression, depending on the interplay with cellular molecules, lncRNAs can modulate transcriptional interference, chromatin remodeling, post-translational regulation and protein modification, and further interfere with signaling pathways. Aiming to the diagnosis/ prognosis markers or potential therapeutical targets, it is important to figure out the specific mechanism and the tissue-specific expressing patterns of lncRNAs. Generally, the bioinformatics analysis is the first step. More and more in silico databases are increasing. But the existing integrative online platforms’ functions are not only having their unique features but also share some common features, which may lead to a waste of time for researchers. Here, we reviewed these web tools according to the functions. For each database, we clarified the data source, analysis method and the evidence that the analysis result is derived from. This review also illustrated examples in practical use for a specific lncRNA by these web tools. It will provide convenience for researchers to quickly choose the appropriate bioinformatics web tools in oncology studies.

Introduction

Long-non-coding RNAs (lncRNAs) are a heterogeneous category of non-coding RNAs containing microRNA, snoRNA, rRNA, tRNA and so on, and which are defined as RNAs longer than 200 nucleotides lacking coding potential (1, 2).

Emerging studies continuously revealed lncRNAs involving in the biological processes (3, 4). By interacting with specific gene sequences, targeting mRNAs or proteins, lncRNAs work as signal, decoy, scaffold or guide and function in transcriptional interference, chromatin remodeling, post-translational regulation and protein modification (5). Deregulated lncRNAs participate in the occurrence and development of diseases, especially in tumors. The important roles of lncRNAs have been verified in tumorigenesis, such as genome instability (6–8), promoting proliferation (9), evading growth suppressors (10), resisting apoptosis (11) and angiogenesis (12). In tumor progression, lncRNAs have also been found to regulate several steps of metastasis (13–15), such as cell invasion, migration and avoidance of immune destruction (16). Some lncRNAs are tissue- or cancer-type-specific expression, which may contribute to the diagnosis (17), survival prediction (18) and provide potential therapy targets (19).

Some lncRNAs are screened by high-throughput sequencing (20, 21). In general, to investigate the roles and mechanisms of lncRNAs, studies estimate the expression differences and predicate localization and interaction targets. To analyze clinical relevance, survival analysis and prognostic assessment have been applied to specific cancer. Afterwards, in vitro and in vivo experiments are designed to perform validation. In most of these studies, bioinformatics analysis is the first step. With the contribution of increasing high-throughput sequencing data and the opening source of clinical information, more and more in silico databases are arising. Meanwhile, manually curated databases of experimental lncRNAs are also emerging (Figure 1). Some webtools for lncRNAs analyses are multifunctional, interactive and user-friendly and less time-consuming especially for researchers who lack programming capability. They can get novel or previously unannotated lncRNAs to guide their further studies.

Timeline of lncRNA-related databases.
Figure 1.

Timeline of lncRNA-related databases.

However, existing integrative online platforms’ functions are partially overlapped and have their own features, which make it difficult for users to choose. Here, we reviewed these web tools according to the functions and clearly displayed the functions by tables, including both basic information (Table 1) and special features (Table 2), providing convenience for researchers to quickly choose the right one for their further studies.

Table 1.

Basic information of reviewed LncRNA analysis web tools

Contents NameData sourcesQuoted rate
(Until 2021)
URLDOICountryThe first release time
The update time
General platformslncRNomeGencode Release, Ensembl, HGNC, NCBI88http://genome.igib.res.in/lncRNome10.1093/database/bat034India2013
LNCipediaLncRNAdb, Broad Institute, Ensembl, Gencode, Refseq, NONCODE, FANTOM402https://lncipedia.org10.1093/nar/gks915
10.1093/nar/gky1031
Belgium2013
2018
NONCODEEnsembl, RefSeq, lncRNAdb, Incipedia, GenBank1006http://www.noncode.org/10.1093/nar/gki041
10.1093/nar/gkx1107
China2004
2015
2018
2020
LncBookGENCODE, NONCODE, LNCipedia, MiTranscriptome beta39http://bigd.big.ac.cn/lncbook10.1093/nar/gky960China2019
AnnoLncCGHub, GENCONE, LncRNAdb, UCSC, GO, GEO28http://annolnc.gao-lab.org/10.1186/s12864-016-3287-9
10.1093/nar/gkaa368
China2016
2020
LncExpDBGENCODE, NONCODE, LNCipedia, RefLnc, LncBook, FANTOM-CAT2https://bigd.big.ac.cn/lncexpdb10.1093/nar/gkaa850China2020
LncSEACistrome, NCBI, TCGA, ENCODE, LncMap, Lnc2Cancer2.0, LncRNADisease 2.0, GTRD1http://bio.liclab.net/LncSEA/index.php10.1093/nar/gkaa806China2020
Specific function analysisLncRNASNP2TCGA, COSMIC57http://bioinfo.life.hust.edu.cn/lncRNASNP#!/10.1093/nar/gku1000
10.1093/nar/gkx1004
China2014
2017
LncRNA2TargetPubmed, GEO198http://123.59.132.21/lncrna2target/index.jsp10.1093/nar/gku1173
10.1093/nar/gky1051
China2015
2018
LncLocatorRNALocate79http://www.csbio.sjtu.edu.cn/bioinf/lncLocator/10.1093/bioinformatics/bty085China, Netherlands2018
LncTarDTCGA, Pubmed9http://bio-bigdata.hrbmu.edu.cn/LncTarD/index.jsp10.1093/nar/gkz985China2019
Clinical significance analysis and predictionLncRNADiseasePubmed624http://cmbi.bjmu.edu.cn/lncrnadisease10.1093/nar/gks1099
10.1093/nar/gky905
China2012
2018
TANRICCouchDB274https://ibl.mdanderson.org/tanric/_design/basic/main.html10.1158/0008-5472.Can-15-0273United States2015
Lnc2CancerPubmed, TCGA, GEO, Ensembl, RefSeq, NONCODE, COSMIC, OMIM226http://www.bio-bigdata.net/lnc2cancer10.1093/nar/gkv1094
10.1093/nar/gkaa1006
China2015
2020
Lnc2CatlasGENCODE, TCGA, dbSNP, MalaCards, DisGeNET, Pubmed11https://lnc2catlas.bioinfotech.org/home/10.1038/s41598-018-20232-4China2018
LnCARAffymetrix, Aglient, Illumina8https://lncar.renlab.org/10.1158/0008-5472.Can-18-2169China2019
LncR2MetastaPubmed, Enterz, EnsemblN/Ahttp://lncr2metasta.wchoda.com/search10.1093/bib/bbaa178China, Netherlands2020
ncRNA-eQTLTCGA, GWAS, GENCODE6http://ibi.hzau.edu.cn/ncRNA-eQTL/10.1093/nar/gkz711China2020
Contents NameData sourcesQuoted rate
(Until 2021)
URLDOICountryThe first release time
The update time
General platformslncRNomeGencode Release, Ensembl, HGNC, NCBI88http://genome.igib.res.in/lncRNome10.1093/database/bat034India2013
LNCipediaLncRNAdb, Broad Institute, Ensembl, Gencode, Refseq, NONCODE, FANTOM402https://lncipedia.org10.1093/nar/gks915
10.1093/nar/gky1031
Belgium2013
2018
NONCODEEnsembl, RefSeq, lncRNAdb, Incipedia, GenBank1006http://www.noncode.org/10.1093/nar/gki041
10.1093/nar/gkx1107
China2004
2015
2018
2020
LncBookGENCODE, NONCODE, LNCipedia, MiTranscriptome beta39http://bigd.big.ac.cn/lncbook10.1093/nar/gky960China2019
AnnoLncCGHub, GENCONE, LncRNAdb, UCSC, GO, GEO28http://annolnc.gao-lab.org/10.1186/s12864-016-3287-9
10.1093/nar/gkaa368
China2016
2020
LncExpDBGENCODE, NONCODE, LNCipedia, RefLnc, LncBook, FANTOM-CAT2https://bigd.big.ac.cn/lncexpdb10.1093/nar/gkaa850China2020
LncSEACistrome, NCBI, TCGA, ENCODE, LncMap, Lnc2Cancer2.0, LncRNADisease 2.0, GTRD1http://bio.liclab.net/LncSEA/index.php10.1093/nar/gkaa806China2020
Specific function analysisLncRNASNP2TCGA, COSMIC57http://bioinfo.life.hust.edu.cn/lncRNASNP#!/10.1093/nar/gku1000
10.1093/nar/gkx1004
China2014
2017
LncRNA2TargetPubmed, GEO198http://123.59.132.21/lncrna2target/index.jsp10.1093/nar/gku1173
10.1093/nar/gky1051
China2015
2018
LncLocatorRNALocate79http://www.csbio.sjtu.edu.cn/bioinf/lncLocator/10.1093/bioinformatics/bty085China, Netherlands2018
LncTarDTCGA, Pubmed9http://bio-bigdata.hrbmu.edu.cn/LncTarD/index.jsp10.1093/nar/gkz985China2019
Clinical significance analysis and predictionLncRNADiseasePubmed624http://cmbi.bjmu.edu.cn/lncrnadisease10.1093/nar/gks1099
10.1093/nar/gky905
China2012
2018
TANRICCouchDB274https://ibl.mdanderson.org/tanric/_design/basic/main.html10.1158/0008-5472.Can-15-0273United States2015
Lnc2CancerPubmed, TCGA, GEO, Ensembl, RefSeq, NONCODE, COSMIC, OMIM226http://www.bio-bigdata.net/lnc2cancer10.1093/nar/gkv1094
10.1093/nar/gkaa1006
China2015
2020
Lnc2CatlasGENCODE, TCGA, dbSNP, MalaCards, DisGeNET, Pubmed11https://lnc2catlas.bioinfotech.org/home/10.1038/s41598-018-20232-4China2018
LnCARAffymetrix, Aglient, Illumina8https://lncar.renlab.org/10.1158/0008-5472.Can-18-2169China2019
LncR2MetastaPubmed, Enterz, EnsemblN/Ahttp://lncr2metasta.wchoda.com/search10.1093/bib/bbaa178China, Netherlands2020
ncRNA-eQTLTCGA, GWAS, GENCODE6http://ibi.hzau.edu.cn/ncRNA-eQTL/10.1093/nar/gkz711China2020
Table 1.

Basic information of reviewed LncRNA analysis web tools

Contents NameData sourcesQuoted rate
(Until 2021)
URLDOICountryThe first release time
The update time
General platformslncRNomeGencode Release, Ensembl, HGNC, NCBI88http://genome.igib.res.in/lncRNome10.1093/database/bat034India2013
LNCipediaLncRNAdb, Broad Institute, Ensembl, Gencode, Refseq, NONCODE, FANTOM402https://lncipedia.org10.1093/nar/gks915
10.1093/nar/gky1031
Belgium2013
2018
NONCODEEnsembl, RefSeq, lncRNAdb, Incipedia, GenBank1006http://www.noncode.org/10.1093/nar/gki041
10.1093/nar/gkx1107
China2004
2015
2018
2020
LncBookGENCODE, NONCODE, LNCipedia, MiTranscriptome beta39http://bigd.big.ac.cn/lncbook10.1093/nar/gky960China2019
AnnoLncCGHub, GENCONE, LncRNAdb, UCSC, GO, GEO28http://annolnc.gao-lab.org/10.1186/s12864-016-3287-9
10.1093/nar/gkaa368
China2016
2020
LncExpDBGENCODE, NONCODE, LNCipedia, RefLnc, LncBook, FANTOM-CAT2https://bigd.big.ac.cn/lncexpdb10.1093/nar/gkaa850China2020
LncSEACistrome, NCBI, TCGA, ENCODE, LncMap, Lnc2Cancer2.0, LncRNADisease 2.0, GTRD1http://bio.liclab.net/LncSEA/index.php10.1093/nar/gkaa806China2020
Specific function analysisLncRNASNP2TCGA, COSMIC57http://bioinfo.life.hust.edu.cn/lncRNASNP#!/10.1093/nar/gku1000
10.1093/nar/gkx1004
China2014
2017
LncRNA2TargetPubmed, GEO198http://123.59.132.21/lncrna2target/index.jsp10.1093/nar/gku1173
10.1093/nar/gky1051
China2015
2018
LncLocatorRNALocate79http://www.csbio.sjtu.edu.cn/bioinf/lncLocator/10.1093/bioinformatics/bty085China, Netherlands2018
LncTarDTCGA, Pubmed9http://bio-bigdata.hrbmu.edu.cn/LncTarD/index.jsp10.1093/nar/gkz985China2019
Clinical significance analysis and predictionLncRNADiseasePubmed624http://cmbi.bjmu.edu.cn/lncrnadisease10.1093/nar/gks1099
10.1093/nar/gky905
China2012
2018
TANRICCouchDB274https://ibl.mdanderson.org/tanric/_design/basic/main.html10.1158/0008-5472.Can-15-0273United States2015
Lnc2CancerPubmed, TCGA, GEO, Ensembl, RefSeq, NONCODE, COSMIC, OMIM226http://www.bio-bigdata.net/lnc2cancer10.1093/nar/gkv1094
10.1093/nar/gkaa1006
China2015
2020
Lnc2CatlasGENCODE, TCGA, dbSNP, MalaCards, DisGeNET, Pubmed11https://lnc2catlas.bioinfotech.org/home/10.1038/s41598-018-20232-4China2018
LnCARAffymetrix, Aglient, Illumina8https://lncar.renlab.org/10.1158/0008-5472.Can-18-2169China2019
LncR2MetastaPubmed, Enterz, EnsemblN/Ahttp://lncr2metasta.wchoda.com/search10.1093/bib/bbaa178China, Netherlands2020
ncRNA-eQTLTCGA, GWAS, GENCODE6http://ibi.hzau.edu.cn/ncRNA-eQTL/10.1093/nar/gkz711China2020
Contents NameData sourcesQuoted rate
(Until 2021)
URLDOICountryThe first release time
The update time
General platformslncRNomeGencode Release, Ensembl, HGNC, NCBI88http://genome.igib.res.in/lncRNome10.1093/database/bat034India2013
LNCipediaLncRNAdb, Broad Institute, Ensembl, Gencode, Refseq, NONCODE, FANTOM402https://lncipedia.org10.1093/nar/gks915
10.1093/nar/gky1031
Belgium2013
2018
NONCODEEnsembl, RefSeq, lncRNAdb, Incipedia, GenBank1006http://www.noncode.org/10.1093/nar/gki041
10.1093/nar/gkx1107
China2004
2015
2018
2020
LncBookGENCODE, NONCODE, LNCipedia, MiTranscriptome beta39http://bigd.big.ac.cn/lncbook10.1093/nar/gky960China2019
AnnoLncCGHub, GENCONE, LncRNAdb, UCSC, GO, GEO28http://annolnc.gao-lab.org/10.1186/s12864-016-3287-9
10.1093/nar/gkaa368
China2016
2020
LncExpDBGENCODE, NONCODE, LNCipedia, RefLnc, LncBook, FANTOM-CAT2https://bigd.big.ac.cn/lncexpdb10.1093/nar/gkaa850China2020
LncSEACistrome, NCBI, TCGA, ENCODE, LncMap, Lnc2Cancer2.0, LncRNADisease 2.0, GTRD1http://bio.liclab.net/LncSEA/index.php10.1093/nar/gkaa806China2020
Specific function analysisLncRNASNP2TCGA, COSMIC57http://bioinfo.life.hust.edu.cn/lncRNASNP#!/10.1093/nar/gku1000
10.1093/nar/gkx1004
China2014
2017
LncRNA2TargetPubmed, GEO198http://123.59.132.21/lncrna2target/index.jsp10.1093/nar/gku1173
10.1093/nar/gky1051
China2015
2018
LncLocatorRNALocate79http://www.csbio.sjtu.edu.cn/bioinf/lncLocator/10.1093/bioinformatics/bty085China, Netherlands2018
LncTarDTCGA, Pubmed9http://bio-bigdata.hrbmu.edu.cn/LncTarD/index.jsp10.1093/nar/gkz985China2019
Clinical significance analysis and predictionLncRNADiseasePubmed624http://cmbi.bjmu.edu.cn/lncrnadisease10.1093/nar/gks1099
10.1093/nar/gky905
China2012
2018
TANRICCouchDB274https://ibl.mdanderson.org/tanric/_design/basic/main.html10.1158/0008-5472.Can-15-0273United States2015
Lnc2CancerPubmed, TCGA, GEO, Ensembl, RefSeq, NONCODE, COSMIC, OMIM226http://www.bio-bigdata.net/lnc2cancer10.1093/nar/gkv1094
10.1093/nar/gkaa1006
China2015
2020
Lnc2CatlasGENCODE, TCGA, dbSNP, MalaCards, DisGeNET, Pubmed11https://lnc2catlas.bioinfotech.org/home/10.1038/s41598-018-20232-4China2018
LnCARAffymetrix, Aglient, Illumina8https://lncar.renlab.org/10.1158/0008-5472.Can-18-2169China2019
LncR2MetastaPubmed, Enterz, EnsemblN/Ahttp://lncr2metasta.wchoda.com/search10.1093/bib/bbaa178China, Netherlands2020
ncRNA-eQTLTCGA, GWAS, GENCODE6http://ibi.hzau.edu.cn/ncRNA-eQTL/10.1093/nar/gkz711China2020
Table 2.

Function characteristics of LncRNA analysis web tools

Sequence informationMolecular functionClinical relevance
NameSequence analysisTranscript informationConservativeSecondary structureMethylationSNPCoding potentialSubcell locationGene interactionmRNA interactionmiRNA interactionProtein interactionSignal pathwayExpression differenceExpression profilesVariationSurvival analysisCancer stageMetastasisDisease-related lncRNADrug
lncRNome
LNCipedia
NONCODE
LncBook
AnnoLnc
LncExpDB
LncSEA
LncRNASNP
LncRNA2Target
LncLocator
LncTarD
LncRNADisease
TANRIC
Lnc2Cancer
Lnc2Catlas
LnCAR
LncR2Metasta
ncRNA-eQTL
Sequence informationMolecular functionClinical relevance
NameSequence analysisTranscript informationConservativeSecondary structureMethylationSNPCoding potentialSubcell locationGene interactionmRNA interactionmiRNA interactionProtein interactionSignal pathwayExpression differenceExpression profilesVariationSurvival analysisCancer stageMetastasisDisease-related lncRNADrug
lncRNome
LNCipedia
NONCODE
LncBook
AnnoLnc
LncExpDB
LncSEA
LncRNASNP
LncRNA2Target
LncLocator
LncTarD
LncRNADisease
TANRIC
Lnc2Cancer
Lnc2Catlas
LnCAR
LncR2Metasta
ncRNA-eQTL
Table 2.

Function characteristics of LncRNA analysis web tools

Sequence informationMolecular functionClinical relevance
NameSequence analysisTranscript informationConservativeSecondary structureMethylationSNPCoding potentialSubcell locationGene interactionmRNA interactionmiRNA interactionProtein interactionSignal pathwayExpression differenceExpression profilesVariationSurvival analysisCancer stageMetastasisDisease-related lncRNADrug
lncRNome
LNCipedia
NONCODE
LncBook
AnnoLnc
LncExpDB
LncSEA
LncRNASNP
LncRNA2Target
LncLocator
LncTarD
LncRNADisease
TANRIC
Lnc2Cancer
Lnc2Catlas
LnCAR
LncR2Metasta
ncRNA-eQTL
Sequence informationMolecular functionClinical relevance
NameSequence analysisTranscript informationConservativeSecondary structureMethylationSNPCoding potentialSubcell locationGene interactionmRNA interactionmiRNA interactionProtein interactionSignal pathwayExpression differenceExpression profilesVariationSurvival analysisCancer stageMetastasisDisease-related lncRNADrug
lncRNome
LNCipedia
NONCODE
LncBook
AnnoLnc
LncExpDB
LncSEA
LncRNASNP
LncRNA2Target
LncLocator
LncTarD
LncRNADisease
TANRIC
Lnc2Cancer
Lnc2Catlas
LnCAR
LncR2Metasta
ncRNA-eQTL

General platforms

lncRNome

lncRNome is a database released in 2013 by CSIR (Centre of Scientific and Industrial Research)-Institute of India genomics and Integrative Biology (22). Database statistics show 10 840 of lncRNA genes and 17 547 alternative splice variants. It contains the following sub-class datasets: 4491 RNA–protein interactions, 310 010 variations mapping to lncRNAs and more than 40 000 of epigenetic markers mapping around lncRNA TSS (transcription start site).

In terms of data reference and processing, some of the data in the ‘General’ module come from Gencode release 12 (23). Data such as Disease Association and PMID are derived from manually collated literature. LncRNA sequence and single nucleotide polymorphisms (SNPs) from UCSC (University of California, Santa Cruz) Genome Browser Database (24). RNA structures are calculated by using RNAfold and protein–RNA interactions that are partly based on experimental data sets and a computational prediction method for predicting the residues in RNA involving the support vector machine.

lncRNome provides (i) sequence information: including sequence analysis, basic transcript information, conservative, secondary structure, methylation, SNP, protein-coding potential and expression, (ii) molecular function: including lncRNA–miRNA interaction and lncRNA–protein interaction based on experimental data sets and calculations and (iii) clinical relevance: including variation and disease-related lncRNA.

In the ‘General’ module, researchers can query the expression levels of specific lncRNA in different cancer types and its correlation with other diseases. lncRNome is helpful in large-scale screening experiments, such as expression analysis of disease-associated lncRNAs. Claudia Cava et al. obtained 10 167 lncRNAs reported from lncRNome when they set up co-expression networks among miRNAs, lncRNAs and mRNAs in normal tissues (25). Kaili Luo et al. analyzed data from lncRNome along with the Atlas of Non-coding RNAs in Cancer (TANRIC) to identify novel lncRNAs potentially associated with colorectal cancer (CRC). LncRNA CASC9 was picked for further in-depth study, which was highly expressed and related to poor survival in colon cancer. It showed that CASA9 is a promising prognostic predictor and potential therapeutic target (26).

LNCipedia

LNCipedia (27) was first released in 2012 by the Center for Medical Genetics, Ghent University, Belgium, and then, it has been updated to LNCipedia 5 (28) in 2018. The total number of genes in the database is 56 946 (127 802 transcripts), and the high confidence set contains 49 372 genes (107 039 transcripts).

LncRNA data come from the following databases: LncRNAdb (29), Broad Institute (Human Body Map lincRNAs), Ensembl (30), GENCODE (31), etc. The developers used the RNAfold algorithm to generate the secondary structure and point diagrams with pairing probabilities and applied MirTarget2 (32) algorithm to predict miRNA seeds. The protein-coding potential is evaluated by using two algorithms, Coding-Potential Calculator and PRIDE database at EMBL-EBI (33).

LNCipedia provides (i) sequence information, including expression, sequence analysis, basic transcript information, conservative, secondary structure and protein-coding potential and (ii) molecular function: including miRNA seed prediction.

LNCipedia is a comprehensive database with multiple functions. Alberto Cedro-Tanda et al. annotated the data of HT-seq raw counts with LNCipedia to get lncRNAs and filter the low expressed row count data in their study of exploring lncRNA landscape in breast cancer. Data were further handled to portray lncRNAs in different subtypes of breast cancer. Subsequently, they chose lncRNA AC009283.1 to explore the functions in Human Epidermal Growth Factor Receptor 2-enriched breast cancer. Results showed that AC009283.1 regulates genes involving multiple biological progresses (34).

NONCODE

NONCODE was first released by the Chinese Academy of Sciences in 2004 (35) and the database has been upgraded several times by 2020, with the latest update, NONCODEV6 (14), released in November 2020. NONCODEV6 contains 644 510 lncRNA transcripts in 39 species, collecting 16 animals and 23 plants, with humans containing 173 112 lncRNA transcripts.

As for data collection, NONCODEv6-related lncRNA data are collected from the following databases: Ensembl (36), RefSeq (37), LncRNAdb (38), LNCipedia (28), CANTATAdb (39) and older versions of NONCODE. The developers collected 13 749 lncRNA records and disease-related information through documentation and mutation analysis of genome-wide association study (GWAS) data (40). NONCODEv6 summarizes lncRNA cancer-related information from six related databases [including: LncSpA (41), Lnc2Cancer (42), LncRNAWiki (43), LncTarD (44), MNDR (45) and LncRNADisease (46)] and the literature.

NONCODE provides (i) sequence information: including sequence analysis, basic transcript information, conservative, secondary structure, methylation, SNP, protein-coding potential and expression and (ii) clinical relevance: including variation, cancer cell lines expression profiles and disease-related lncRNAs.

NONCODE provides a user-friendly interface, supporting the collation of lncRNA-related diseases by collecting six human exosome data sets (six tumor cell lines and four tissues) from (The Gene Expression Omnibus) GEO (47) to support the collation of lncRNA-related diseases. In addition, users can query information about the SNP disease of interesting lncRNAs and view the expression spectrum of lncRNA in human tissue provided by the database.

Ching-Ching Yeh et al. explored the molecular mechanism of lncRNA NDRG1-OT1 on NDRG1. And they constructed six fragments according to the 2D structure of NDRG1-OT1 predicted by NONCODE. Furthermore, the results of luciferase assays with different fragments indicated that different fragments of NDRG1-OT1 had different impacts on the same target gene, NDRG1, in breast cancer cells under hypoxia (48).

LncBook

LncBook is a comprehensive lncRNA online analytics platform released by the Chinese Academy of Sciences in 2018 (49). It integrates multi-omics data from databases such as GENCODE v27 (31), NONCODE v5 (50), LNCipedia v4.1 (51), Mi-Transcriptome beta (1) and LncRNAWiki (43) in a structured manner and obtains 27 044 non-redundant lncRNA transcripts which belong to 140 362 gene sites.

To perform functional analysis, LncBook uses three algorithms to predict lncRNA-coding potential: (Laboratory of the Government Chemist) LGC, (53) Coding-Potential Assessment Tool (CPAT) (52) and PLEK (predictor of long non-coding RNAs and messenger RNAs based on an improved k-mer scheme) (53); TargetScan (54) and miRanda (55) to predict the interactions between lncRNA and miRNA; identify new lncRNAs with Human Protein Atlas (56), FastaQC, Trimmomatic (57), Hisat2 (58), StringTie (55) and CuffCompare (59). All the analyses can be easily applied by browsing or lncRNA ID/sequence searching.

LncBook provides (i) sequence information, including, sequence analysis, basic transcript information, methylation, SNP, expression and protein-coding potential prediction, (ii) molecular function, including interactions between lncRNA and miRNA and (iii) clinical relevance, including differential expression between cancer and normal tissues, genomic variation and disease-related lncRNAs.

In the ‘Diseases’ module, the platform describes each lncRNA with symbol, dysfunction type, detailed description, disease name, MeSH Ontology and PMID. LncBook includes 3772 experimentally supported disease-associated lncRNAs, and 9798 lncRNAs may be associated with diseases through methylation, genomic variation and lncRNA–miRNA interaction predictions (49). For each lncRNA, LncBook provides methylation levels for the promoter and body region ofnormal tissues and cancer samples of nine cancers, and these things are summarized in tables and visualized expression difference in a chart. In LncBook’s cancer prediction mechanism, the algorithm suggests that a lncRNA is associated with cancer if high or low methylation persists in at least eight types of cancer.

In practice, to research the molecular mechanism of LINC00673 in breast cancer, Kun Qiao et al. used Lncbook to explore potential miRNAs that may be targeted by LINC00673 and MARK4.258 miRNAs were identified and of which 40 miRNAs that had been claimed to be associated with cancers were selected for further study. The expression of 18 miRNAs was distinctly increased along with the knockdown of LINC00673. Then, miR-515-5p was chose to verify that it could bind to LINC00673 and MARK4 in breast cancer cells (60).

AnnoLnc2

AnnoLnc was first released by Peking University in China in 2016 (61) and updated to AnnoLnc2 (62) in 2020. Developers collected information about lncRNA expression, subcellular location, SNP site, etc., from databases such as CGHub (63), GENCODE, LncRNAdb v2 (38), UCSC (24), gene ontology (GO) and GEO (47). The LncRNA–protein interaction is based on experimental support and sequence prediction.

AnnoLnc2 provides (i) sequence information: including sequence analysis, basic transcript information, conservative, secondary structure, SNP, coding potential and co-expression, (ii) molecular function: including subcellular location, prediction lncRNA–miRNA interaction and lncRNA–protein interaction and (iii) clinical relevance: including variation, expression difference between cancer and normal tissues, cell line expression profiles and disease-related lncRNAs.

In general, while querying the database with lncRNA sequence in Fasta format, the detailed information about the lncRNA can be downloaded or browsed online including its structure, location, expression, interactions and so on. These data can be analyzed and applied to further studies.

AnnoLnc2 is a one-stop portal to annotate novel lncRNAs with multiple functions. The function of predicting lncRNA–miRNA interaction was enrolled in their published studies (64, 65). In the research about lncRNA SNHG6 in CRC, AnnoLnc2 was utilized to compare SNHG6 expression levels in different cancers. Data showed that the expression of SNHG6 was relatively high in CRC compared with that of many other cancers (66). Meanwhile, Mingliang Liu et al. used AnnoLnc2 to predict the secondary structure of lncRNA CF129, which was further validated stabilization of p53 protein in pancreatic cancer progression (67).

LncExpDB

LncExpDB was released by China’s National Bio-Information Center in 2020 (68). The database contains the expression of the lncRNA genes from 101 293 individuals in 337 biological environments and 1977 samples under nine biological conditions. In addition, the developers identified 25 191 signature genes and obtained more than 28.44 million lncRNA–mRNA interactions.

In terms of data sources and processing, LncExpDB-integrated lncRNA transcripts from databases such as LncBook v1.2 (49), RefLnc (69), GENCODE V33 (70), CHESS v2.2 (71) and FANTOM-CAT (72). The developers used four algorithms [LGC, CPC2 (73), CPAT (52) and PLEK (53)] to make the coding potential and the co-expression network to predict lncRNA–mRNA interactions.

LncExpDB provides (i) sequence information: including sequence analysis, basic transcript information and coding potential, (ii) molecular function: including subcellular localization and lncRNA–mRNA interaction and (iii) clinical relevance: including differential expression between cancer and normal tissues, cancer cell lines expression profiles and disease-related lncRNAs.

Expression capacity refers to the differential expression of characteristic lncRNA genes in specific cell line, tissue-specific tables, exosomes, cancers or viral infections. On the interface, users can browse the expressions of interested lncRNAs in nine different biological contexts, including cancer tissue/cell lines, normal tissue/cell line, exosome, cell differentiation, subcellular localization, preimplantation embryo, organ development, virus infection and circadian rhythm. Based on a large amount of RNA-seq data, a total of 2907 cancer cells were identified, and oncologists were able to estimate the expression of the lncRNA’s biological function in cancer cells (68). In the cancer cell line option in the context module, the visual expression of interested lncRNA can be gained in cancer cell line.

LncSEA

LncSEA is a comprehensive database of annotation and enrichment analysis of lncRNA published by Harbin Medical University in China in 2020 (74). The developers integrated lncRNA datasets from more than 20 databases, such as the genome region of the sample from Cistrome (75) and NCBI, which collected lncRNA related to cancer survival from The Cancer Genome Atlas (TCGA) (76) and information on diseases and drugs from databases such as Lnc2Cancer2.0 (77) and LncRNADisease2.0 (46). The relationship between lncRNA–mRNA and lncRNA–miRNA was downloaded from LncACTdb2.0 (78).

LncSEA provides (i) sequence information: including sequence analysis, basic transcript information, conservative, methylation, SNP, coding potential and expression, (ii) molecular function: including subcellular locations, lncRNA–mRNA interaction, lncRNA–miRNA interaction and RNA-binding protein and (iii) clinical relevance: including cancer and normal tissues expression difference, cell lines expression profiles, survival analysis, cancer clinical stage, metastasis, lncRNA-related diseases and drugs.

The developers provided clinical research assistance to users in the LncSEA (Figure 2A), such as disease (each disease collection contains a list of lncRNA associated with the disease), drug (containing two subsets of experimental support and algorithmic prediction), cancer hallmark (including metastasis, apoptosis, prognosis, etc.) and cancer phenotype (database including 40 cancers and 325 phenotypes). In addition, LncSEA provides feature annotations and enrichment analysis capabilities. Oncologists can analyze the differential expression genes of lncRNA in cancers by LncSEA and use the results to study the relationship between lncRNA and cancer or drugs in preparation for the development of new target drugs. On the user interface, users can first explore a certain specific lncRNA and, second, they can click ‘drug’ in subcategory, and then, the drugs on the screen are related to the lncRNA. Third, clicking one of the drugs that have been searched for can link to other lncRNAs even diseases that are related to the drug.

Examples of graphical analysis outputs generated for the LncRNA HOTAIR in cancers using web platforms: (A) Functions (i) and relevant ceRNA network diagram (ii) of HOTAIR was analyzed by LncSEA in 18 kinds of reference data sets, including Accessible chromatin, Cancer hallmark, Cancer phenotype, Cell marker, Disease, Drug, Enhancer, RNA-binding protein, Methylation pattern, MicroRNA, EQTL, smORF, Subcellular localization, Super enhancer, Survival, TF, Exosome and Conservation. (B) The expression differences of HOTAIR in 21 types of cancer and normal tissues was shown in histogram (i) by LncRNASNP2. And the scatter diagram (ii) was shown the binding sites of miRNAs predicted by LncRNASNP2. (C). Lnc2Cancer output an overview of the statistics of lncRNA HOTAIR based on the human map and expression charts of cancer tissues (i). Box plot (ii) was used to compare the expression of HOTAIR between specific cancer and normal samples. (iii) Kaplan–Meier plot showing overall survival in higher (shown in red) and lower (shown in blue) HOTAIR expression groups in of COAD patients. (iv)Violin plots showing lncRNA expressing levels among stage I, II, III and IV COAD samples. D. LncR2Metasta (i) displayed the cancer metastasis events in which lncRNA HOTAIR may involve in and provided supported information such as expression patterns, detection methods and origin research paper. LnCAR (ii) shows the degree of metastasis expression of HOTAIR in different cancers by heat map, where green transition red represents the transition from low expression to high expression.
Figure 2.

Examples of graphical analysis outputs generated for the LncRNA HOTAIR in cancers using web platforms: (A) Functions (i) and relevant ceRNA network diagram (ii) of HOTAIR was analyzed by LncSEA in 18 kinds of reference data sets, including Accessible chromatin, Cancer hallmark, Cancer phenotype, Cell marker, Disease, Drug, Enhancer, RNA-binding protein, Methylation pattern, MicroRNA, EQTL, smORF, Subcellular localization, Super enhancer, Survival, TF, Exosome and Conservation. (B) The expression differences of HOTAIR in 21 types of cancer and normal tissues was shown in histogram (i) by LncRNASNP2. And the scatter diagram (ii) was shown the binding sites of miRNAs predicted by LncRNASNP2. (C). Lnc2Cancer output an overview of the statistics of lncRNA HOTAIR based on the human map and expression charts of cancer tissues (i). Box plot (ii) was used to compare the expression of HOTAIR between specific cancer and normal samples. (iii) Kaplan–Meier plot showing overall survival in higher (shown in red) and lower (shown in blue) HOTAIR expression groups in of COAD patients. (iv)Violin plots showing lncRNA expressing levels among stage I, II, III and IV COAD samples. D. LncR2Metasta (i) displayed the cancer metastasis events in which lncRNA HOTAIR may involve in and provided supported information such as expression patterns, detection methods and origin research paper. LnCAR (ii) shows the degree of metastasis expression of HOTAIR in different cancers by heat map, where green transition red represents the transition from low expression to high expression.

LncSEA is a newly published comprehensive platform providing various lncRNA sets with annotation and enrichment analysis features by submitting lncRNA lists. The lncRNA set enrichment analyses are related to upstream regulators and downstream targets of RNAs. Visual bar and bubble chart of results can be gained friendly.

Specific function analysis

lncRNASNP2

LncRNASNP was first published by China Huazhong University of Science and Technology in 2014 (79) and updated to LncRNASNP2 in 2017 (80). The database provides a web tool that helps users predict the impact of SNPs on the secondary structure of lncRNA. 859 534 CosmicNCVs (Cosmic Non-Coding Variants) were collected from the COSMIC (Catalogue Of Somatic Mutations In Cancer) (81). 71 410 pathogenic variants were found on the lncRNA transcript. Cancer mutations were collected -from the TCGA (82) database and 315 234 mutated lncRNA transcripts were found in 34 cancer types. Developers used algorithms to calculate and collected experimentally supporting data to predict lncRNA-related diseases.

LncRNASNP2 provides (i) sequence information: including sequence analysis, conservative, secondary structure, basic transcript information, SNP and expression, (ii) molecular function: including miRNA–lncRNA interaction prediction and (iii) clinical relevance: including expression difference in different cancers, expression profiles in cancer cell lines, variation and disease-related lncRNAs.

For users to study disease-related information, the ‘expression’ section not only summarizes the expression of lncRNA under different tumors and mutations but also compares the expression specificity of lncRNA in different situations [Figure 2B (i)]. Basic information of miRNA provides the level of expression of each human miRNA in 28 cancers [Figure 2B (ii)]; thus, oncologists can view the classification of lncRNA-related diseases [experimental support or based on TAM (83) (The tool for annotations of human miRNAs) prediction].

LncRNASNP2 aims to provide comprehensive information about lncRNA SNPs and their effects on the lncRNA structure and function, particularly miRNA–lncRNA interaction. Wenyuan Zhao et al. used LncRNASNP2 to predict the targeted miRNAs of LINC02310 to further construct a regulatory network in lung adenocarcinoma (84). The potential effects of SNPs on the generation or destruction of the miRNA binding site were analyzed using lncRNASNP2 (85, 86). Different SNPs in the same allele allowed different interactions with miRNA, as further studies verified the discrepancy, such as luciferase reporter assay. At last, the study suggested A allele of SNP rs140618127 in lncRNALOC146880 may have a protective effect on non-small cell lung cancer (NSCLC) compared with genotype [G] (86).

LncRNA2Target

LncRNA2Target was first released by Harbin Institute of Technology in China in 2014 (87) and updated to LncRNA2Target v2.0 (88) in 2019. Until today, the database contains 152 137 lncRNA–target associations from 1047 papers and 224 data sets.

Developers retrieved literature and datasets from the Pubmed and GEO (47) databases. The target gene of lncRNA was collected from all differential expression genes after knocking down or over-expression of a certain lncRNA or inferred from low-throughput experiments (such as immunoprecipitation assays, RNA pull down assays, luciferase reporter assays RT-qPCR (Real Time Quantitative PCR) and western blot).

LncRNA2Target provides (i) sequence information including sequence analysis and basic transcript information, (ii) molecular function: expression and gene interaction and (iii) clinical relevance: including expression difference, cancer cell lines, survival analysis and disease-related lncRNAs.

In clinical studies, users can directly query the disease status and expression spectrum in cancer cell lines associated with the lncRNA target gene. When users click on the target symbol, experimental evidence supporting lncRNA–target association is displayed, including links to the original literature and partial prognostic analysis.

LncRNA2Target is chiefly aiming to provide lncRNA-to-target genes experimentally verified. In the study of identifying cancer-related lncRNAs, Zihao Liu et al. used LncRNA2Target to obtain target genes of lncRNA and, then, select the most common target genes corresponding to more than five lncRNAs to be the features of lncRNAs. With the cancer-related lncRNAs gained from LncRNADisease database and some methods such as Deep Belief Network applied to the study, potential lncRNAs for 16 kinds of cancers were obtained (89).

LncLocator

Recent studies have shown that the subcellular location of lncRNA provides important information for understanding their complex biological functions. LncLocator, the first web tool for predicting lncRNA subcellular positioning, was released in 2018 by Shanghai Jiao Tong University in China (90) and updated version 2.0 in 2021 (91). Humanly managed and experimentally supported RNA-related subcellular positioning entries are downloaded from the comprehensive database RNALocate (92), and the filtered baseline dataset contains 612 lncRNAs covering five subcellular compartments.

In version 1.0, K-mer (93) and deep structure abstract features are incorporated into the model construction of this study, and a supervised oversampling method called SOS (supervised oversampling) (94) is adopted. To improve prediction performance, LncLocator uses a stacked combination strategy to combine four learning machines, including random forest with features extracted by deep neural networks (RFA), supporting vector machine with features extracted by deep neural networks (SVMA), random forest with raw k-mer features (RFR) and support vector machine with raw k-mer features (SVMR).

In the 2.0 version, the developers pre-constructed a baseline dataset of lncRNA subcellular positioning for 15 cell lines. Then they used natural language models to learn word embedding and input the embedding of these learnings into the convolutional neural network, long short-term memory and multilayer perceptron to classify different cell lines effectively.

In terms of function, LncLocator can analyze lncRNA sequence, calculate and predict five subcellular locations, including nucleus, cytoplasm, ribosome, membrane and exosome. Each item has a 0–1 rating mechanism, and the higher score indicates a greater chance of reaching the cubicle. In the previously constructed benchmark data set test, the prediction accuracy of the LncLocator database was 0.59.

The functions of lncRNAs rely on the subcellular locations. For instance, lncRNAs located in cytoplasm regulate posttranscriptional expression. Practically when researchers determine the molecular mechanism of lncRNAs, LncLocator is applied to predict the locations of lncRNAs. To analyze the biological functions of lncRNA LCAT1 in lung cancer, Juze Yang et al. predicted the location of lncRNA LCAT1 by LncLocator. It was suggested to be located in cytoplast, indicating LCAT1 may regulate protein expression at the posttranscriptional level. Further studies verified that LCAT1 regulate functions via miR-4715-5p (95).

LncTarD

LncTarD was released by Harbin Medical University in 2019 (44), including 33 types of cancer from TCGA, in addition 1862 articles from PubMed.

LncTarD provides (i) sequence information: including sequence analysis, methylation, basic transcript information and expression, (ii) molecular function: including lncRNA–mRNA interaction and lncRNA–protein interaction and (iii) clinical relevance: including metastasis, variation, differential expression between different cancers and normal tissues, lncRNA-mediated drugs and disease-related lncRNAs.

Using the data in LncTarD, users can visually see the relevant lncRNA target regulations of top lncRNA, disease and drug in the ‘Browser’ module. Developers provide users with a visual website to show the regulation of lncRNA target in diseases, including the biological function of the effects from lncRNAs and related drugs. On the visual dynamic chart interface, oncologists can view box charts of differences in lncRNA in various tumors and normal tissues, as well as other disease-related scatter pages or heat maps.

LncTarD is a manually curated database supported by experiments mainly focusing on diseases relevant lncRNAs. Qinchen Lu et al. used the database to validate the expression of lncRNA HOXA-AS2 in hepatocellular carcinoma (HCC) in TCGA, up-regulated in both cell lines and tissues of HCC. Further studies verified that the higher expression of HOXA-AS2 was correlated with poor prognosis positively and stem cell-related functions (96).

Clinical significance analysis and prediction

LncRNADisease

LncRNADisease is a database of lncRNA associated with disease, released by Peking University in China in 2013 (97) and updated to LncRNADisease 2.0 in 2018 (46). LncRNADisease 2.0 contains 195 395 predicted lncRNA–disease associations, 12 207 lncRNA-mRNAs, and 2368 miRNA–lncRNA regulatory relationships. It also integrates lncRNA–disease associations supported by 10 564 experiments and circRNA–disease associations supported by 1004 experiments across four species (46).

In LncRNADisease 2.0, an option provided on the search page that allows users to filter the association of lncRNA with disease through certain experimental methods and assign a confidence score to each entry by integrating experimental and computational evidence. To further explain the functional significance of disease-related lncRNAs, a visual lncRNA–miRNA–mRNA network was constructed.

In general, LncRNADisease provides (i) sequence information: expression, sequence analysis and basic transcript information, (ii) molecular function: subcellular location, gene interaction, lncRNA–protein interaction, lncRNA–mRNA interaction and lncRNA–miRNA interaction and (iii) clinical relevance: lncRNA-related diseases.

Notably, the database provides detailed descriptions of lncRNA-related disease names, dysfunction types, detailed descriptions of lncRNA–disease associations and PMIDs of the corresponding literature. After the update, the database can provide the total associated disease number for lncRNA. To make it easier for users to obtain disease information from external sources, disease names are mapped to disease ontology (DO) (98), medical subject headings (MESH) (99) and each lncRNA–disease association entry includes disease name, category, type, define and alias. Information about the total number of disease-related lncRNAs, forecasting methods, etc., is also available for users.

To obtain more NSCLC relative lncRNAs, LncRNADisease was examined to find that CBR3-AS1, a latent functional lncRNA, is up-regulated expression in NSCLC, indicating CBR3-AS1 may play roles in the occurrence of NSCLC. Further research works were validated that CBR3-AS1 regulated the biofunctions of NSCLC cells such as proliferation, migration and invasion through affecting the dynamic process of β-catenin transport that activated the Wnt signaling pathway (100).

TANRIC

TANRIC is a web tool for analysis and prediction of lncRNA clinical significance (101). It was released by the Department of Bioinformatics and Computational Biology in 2015, and the latest update was in 2019. More than 40 different cancer types and cell lines data in TANRIC are extracted from the following databases, including TCGA, Cancer Cell Line Encyclopedia (102) and another three independent studies (GBM (103), ccRCC and LUAD).

On data processing, TANRIC lets the lncRNA expression as Reads Per Kilobase per Million mapped reads (104) to make users quickly analyze the specified lncRNA on a web page. Using the annotated lncRNA expression data and pre-calculated correlations with clinical and genomic data, which are stored in CouchDB, TANRIC can perform correlation, expression difference and survival analysis by R and output visual results directly.

In general, TANRIC provides (i) sequence information: including sequence analysis, basic transcript information and expression, (ii) molecular function, including signal pathway, gene interaction, mRNA interaction, miRNA interaction and interactions between protein and lncRNA and (iii) clinical relevance, including variation, differential expression between cancers and normal tissues, cancer cell lines expression profiles, analyzed the patient’s survival time, tumor stage and tumor grade. It also provides genomic variation, disease-related lncRNA and evaluates the effects of lncRNA expression on drug sensitivity.

In recent studies, TANRIC has been widely used (312 cited research papers, to date) to find out specific lncRNA with significant differences between cancers and normal controls. The expression status of these lncRNAs has been analyzed by TANRIC. Lu Feng et al. found LINC02487 was downregulated in neck squamous cell carcinoma (105). Yuxing Zhu et al. discovered LINC00968 was decreased in lung adenocarcinoma (106). Based on the analysis, further studies indicated that LINC02487 might act as a tumor suppressor in oral squamous cell carcinoma and LINC00968 may play roles as a diagnostic and prognostic biomarker and therapeutic target in lung adenocarcinoma.

Lnc2Cancer

Lnc2Cancer was first released by Harbin Medical University in 2015 (107) and latest updated to Lnc2Cancer 3.0 (42) in 2020. It aims to provide integrated resource for exploring lncRNA deregulation in various human cancers. Lnc2Cancer 3.0 records 10 303 entries associate between 2659 human lncRNAs or 743 circRNAs and 216 human cancer subtypes thorough review of more than 15 000 published papers (42).

In terms of data sets, developers manually collated literature from the PubMed database on such things as ‘lncRNA cancer’ and ‘long-non-coded RNA cancer’. Lnc2Cancer provides identifiers and links to lncRNA that exist in the Ensembl and RefSeq databases. Finally, a standardized classification scheme, the International Classification of Diseases for Oncology, the 3rd Edition (ICD-O-3), annotates each type of cancer. Lnc2Cancer also provides hyperlinks to other data information, such as lncRNAs in lncRNAdb (38) and NONCODE (14), as well as cancer in the OMIM (108) and COSMIC (109) databases. In the updated version 3.0, new cancer-related lncRNA functions are identified based on uploaded single-cell datasets and implemented by R package Seurat (version 3.1.5).

Lnc2Cancer provides (i) sequence information: including sequence analysis, basic transcript information, methylation, coding ability and expression, (ii) molecular function: including subcellular location, lncRNA–mRNA interaction and lncRNA–miRNA interaction and (iii) clinical relevance: including expression difference, variation, survival analysis, cancer clinical stage, metastasis, disease-related lncRNA and drug resistance.

Lnc2Cancer provides a user-friendly interface to query a range of cancer-related lncRNAs on the ‘RNA-seq web tool’ page. It includes General Information, Differential Expression Analysis (DEA), Box Plotting [Figure 2C (ii)], Stage Plotting, Survival Analysis [Figure 2C (iii)], Similar lncRNAs Identification, Correlation Analysis, Network Construction and Transcription Factor (TF) Motif Prediction. General Information also has a global view of interested lncRNA statistics based on a human map and lncRNA chart based on cancer type [Figure 2C (i)]. Stage Plotting, which generates a violin diagram of the expression of specific lncRNA based on the patient’s pathological stages [Figure 2C (iv)], allows the user to select one of detailed cancer stages. The Correlation Analysis feature provides expression correlation analysis for two interested lncRNAs in cancer.

Zhengwei Du et al. identified lncRNA-meditated transcriptional dysregulation triplets (lncTDTs) in luminal and triple-negative breast cancer basing on TF, gene, lncRNA expression profiles and TF–gene interactions verified by experiment. They collected cancer-related lncRNAs from lnc2cancer, which differentially expressed in these subtypes using t-test. The specific lncRNAs were subsequently analyzed and lncTDTs aiding development of classification and treatment of breast cancer were gained ultimately (110).

Lnc2Catlas

Since lncRNAs are involved in multiple signaling pathways, it is essential to illustrate the molecular network interplay. Lnc2Catlas (111) was released in 2018 by the Department of Biotechnology of the Beijing Institute of Radiology in China. The database contains 247 124 pairs of lncRNA–SNP, ∼2 million lncRNA–protein interactions and 6902 co-expression clusters.

Lnc2Catlas studied the relationship between lncRNA and cancer from 33 cancer expression files in TCGA (82), annotated lncRNA transcripts from GENCODE V22, SNP in lncRNA transcripts from dbSNP and cancer-related proteins from MalaCards, DisGeNET and human protein maps. Lnc2Catlas uses three computational methods to evaluate the relationship between quantitative lncRNA and cancer, including evaluating secondary structural damage (quantitatively assessed with RNAsnp), lncRNA–protein interaction (using Global Score calculations) and co-expression networks (building weighted co-expression networks based on TCGA to WGCNA). Finally, a total of 1038 articles related to lncRNA associated with cancer are compiled to augment the experimentally supporting ability in Lnc2Catlas.

Lnc2Catlas provides (i) sequence information, including sequence analysis, basic transcript information, secondary structure, SNP-structure disruption and expression, (ii) molecular function, including lncRNA–protein interaction and signal pathway and (iii) clinical relevance, including expression difference (between different cancers) and lncRNA-related diseases.

Tao Luo et al. explored the co-expression network of ELMO1-AS1 in Lnc2Catlas, which 3D space showed that it is associated with various diseases and proteins (112). The result implied the significance of ELMO1-AS1 in biology. ELMO1-AS1 may be a biomarker of prognostic and therapy in HCC demonstrated by further studies (112).

Lnc2Catlas provides direct access to all 33 cancers collected with a friendly visualization page. Researchers can query in the SNP-structure disruption section that SNPs of lncRNA are associated with different cancers, and by evaluating the relationship by hitting the relationship, provide the average expression level of lncRNA in 33 cancers in the expression section, show the subgraph of lncRNA in the cancer in the co-expression network section, pull down the selection of cancers of interest in the GO and pathway analysis sections and, finally, provide experimental support literature.

LnCAR

As an open-source of high-throughput sequencing data, lnCAR (lncRNAs from Cancer Arrays) was released in 2019 by the Sun Yat-sen University, China (113). The database lnCAR contains 732 expression spectrum array datasets covering 54 893 samples of 10 cancer types (about 52 300 cases in the sample have more than seven different sets of clinical component information). Out of 107 gene expression microarray platforms, a total of 464 502 lncRNA probes were obtained to represent 26 464 lncRNAs.

In order to unify lnCAR data standards, different algorithms are used for database resources: Affymetrix (114) data sets use RMA algorithms, and for datasets from Agilent, Illumina and other platforms, the original data are graded and normalized using LIMMA packages (115), all of which are standardized at the detection level and converted to log2.

lnCAR provides (i) sequence information: including sequence analysis, basic transcript information, conservation, secondary structure and coding potential and expression, (ii) molecular function: signal pathway and (iii) clinical relevance: including differential expression between cancer and normal tissues, cell lines expression profiles, variation, survival analysis, cancer clinical stage, metastasis, disease-related lncRNA and drug response.

Jinlin Jia et al. analyzed the bioinformatics data from lnCAR and uncovered that lncRNA FAM83A-AS1 was highly expressed in esophageal cell squamous carcinoma (ESCC). Involving 51 ESCC tissues versus adjacent non-tumor tissues, qRT-PCR results also indicated remarkably increased FAM83A-AS1 in ESCC. Further studies were revealed that FAM83A-AS1 acted as tumorigenesis in ESCC and distinguished ESCC with high accuracy (116).

lnCAR provides users with a great deal of information to query clinical data. The database provides a visual heat map of different expressions of results and different types of survival information under different conditions, where ‘differential expression’ includes tumor vs. normal, high grade vs. low grade, high stage vs. low stage, metastasis vs. primary [Figure 2D (ii)], treatment vs. control and other features. And users can explore the relationship between lncRNA and different survival outcomes, such as total survival and recurrence-free survival for different cancer types.

LncR2metasta

Numerous studies have shown that metastasis is a major limiting factor affecting cancer prognosis and a huge obstacle in clinical treatment. LncR2metasta was released by Wuhan University of Science and Technology in China in 2020 (117). The database collected 1283 associations between 304 lncRNAs and 39 tumor metastasis events out of 54 tumor types.

A large amount of data indicates that lncRNA is involved in a variety of cancer metastatic events (CMEs). The database manually collates lncRNAs supported by experiments in various CMEs in existing studies, in addition, conducts literature retrieval through Pubmed for insanity transfer experiments, and other data are collected from databases such as Entrez (118) and Ensembl (119) that are finally presented through web-based databases.

LncR2metasta provides (i) sequence information: including sequence analysis, basic transcript information and expression pattern, (ii) molecular function: including target gene (or pathway) of lncRNA and (iii) clinical relevance: including cancer cell lines expression profiles, CMEs and lncRNA-related diseases.

In clinical studies, research works on lncRNA-CME can help understand the role of lncRNA in metastasis and provide reliable biomarkers for early detection of metastasis and optimal treatment. Oncologists can query lncRNA in the database they are interested in [Figure 2D (i)], or they can look for lncRNA based on events related to tumor metastasis. The database also provides different transfer-related functions of lncRNA, including ‘cancer cell invasion’, ‘cancer cell migration’ and ‘cancer cell product’. Users can also select a special CME to query related lncRNA information by lncRNA symbol, ID (Entrez ID and Ensembl ID) or location.

ncRNA-eQTL

Assessing the effects of SNPs on ncRNA expression using data from multiple cancer types will help to understand how genetic risk all-like genes promote tumor occurrence and cancer development. Non-coding RNA-expression quantitative trait loci (ncRNA-eQTL) is a database published by Huazhong Agricultural University in China in 2019 (120). There are more than 300 samples of 12 cancer types in the database.

In terms of data sources, SNP parting results, lncRNA and miRNA expression, clinical information of samples was obtained from TCGA (82), and GWAS analysis results were obtained from the GWAS catalog. In addition, the developers linked ncRNA-eQTL to known GWAS loci and patient lifetimes, and the developers also identified thousands of GWAS-related and survival-related eQTLs.

ncRNA-eQTL provides (i) sequence information, including sequence analysis, basic transcript information, expression and SNP, (ii) molecular function: miRNA interaction and (iii) clinical relevance: including differential expression between cancer and normal tissues, variation, survival analysis, cancer clinical stage, metastasis and disease-related lncRNA.

On the clinical side, to prioritize promising ncRNA-eQTL, developers link the eQTL of the database to clinical data in TCGA patients, allowing users to query eQTL that may be relevant to the total lifetime. These survival-related eQTLs and associated ncRNAs can be applied to biomarkers for prediction and prognosis. Users can also query that lncRNA-eQTL overlaps with known disease/symptom-related bits.

Maija Suvanto et al. explored SNPs in lncRNA regions where genetic variants are associated with breast cancer risk. Datasets from Genotype-Tissue Expression GETx and Molecular Taxonomy of Breast Cancer International Consortium were brought into eQTL analysis and further verification by ncRNA-eQTL online. SNPs uc.184 and uc.313 were presumed associated with breast cancer risk, providing candidate loci for further study on molecular mechanisms in breast cancer (121).

Summary and discussion

Accumulating research works gradually uncovered the biogenesis in which the widely expressed lncRNAs are involved. These processes are closely related to their subcellular localizations and specific functions. Depending on the interactions with gene, RNAs or proteins, lncRNAs can modulate transcriptional interference, chromatin remodeling, post-translational regulation and protein modification, and further interfere with the signaling pathways. Ultimately, aberrant functions step over the normal biological process to physiopathological contexts, such as carcinogenesis and progressing.

To help unravel the mechanisms of lncRNAs functions in cancers, bioinformatics databases to analyze lncRNAs are arising. To study the roles and mechanisms of lncRNAs in relevant tumors, oncologists first screen for specific lncRNAs. Some databases based on computer language provide visualized tumor-related lncRNA tables or heat maps, which can help obtain the target lncRNA. When verifying the function of the target lncRNA for further study, some databases provide co-expression nets such as interacting proteins, mRNAs and miRNAs. And in clinical correlation analysis, some databases provide prognosis analysis and disease correlation analysis for visualization. On these grounds, they can design in vivo/in vitro experiments or expand the clinical sample size for verification (Figure 3). Unlike previous databases that require users to know programming language and download plenty of data resources to analyze, some newly established web tools are more convenient and efficient for users. Although there are some good reviews, the databases included are relatively early (122) or minor attention to disease-related databases (123).

Flow chart of lncRNA in silico analysis and validation in oncology research. This flow chart indicates the general process of lncRNA analysis and validation in cancer research, the purpose of which are searching for characteristic markers of diagnosis/prognosis or potential molecular targets for treatment. Both data from open-source datasets or high-throughput sequencing results of clinical samples can be utilized. The specific lncRNA can be screened out by analysis of expressing differences, co-expressing network or cis/trans regulating of oncogenes or tumor suppressors. To clarify its roles, researchers can use web tools (the red dotted line area, which is described detailed in Table 2) to analysis the specific functions, including sequence general information, molecular function, and clinic relevance. It is essential to verify the results of the functional analysis by molecular experiments and animal models. For the establishment of diagnostic/prognostic markers, in addition to validation, lncRNAs also need to be compared with the existing clinical criteria to evaluation.
Figure 3.

Flow chart of lncRNA in silico analysis and validation in oncology research. This flow chart indicates the general process of lncRNA analysis and validation in cancer research, the purpose of which are searching for characteristic markers of diagnosis/prognosis or potential molecular targets for treatment. Both data from open-source datasets or high-throughput sequencing results of clinical samples can be utilized. The specific lncRNA can be screened out by analysis of expressing differences, co-expressing network or cis/trans regulating of oncogenes or tumor suppressors. To clarify its roles, researchers can use web tools (the red dotted line area, which is described detailed in Table 2) to analysis the specific functions, including sequence general information, molecular function, and clinic relevance. It is essential to verify the results of the functional analysis by molecular experiments and animal models. For the establishment of diagnostic/prognostic markers, in addition to validation, lncRNAs also need to be compared with the existing clinical criteria to evaluation.

We also illustrated examples of graphical analysis outputs generated for the lncRNA HOTAIR in cancers using web platforms, hoping that worldwide oncology researchers can use these web tools in exploring the unannotated lncRNAs or known lncRNA with novel functions. These tissue-specific or condition-specific expressing patterns of lncRNAs suggest that they are potential biomarkers in clinical diagnosis and prognosis analysis, meaning these specific roles that they played in the cancer cell fate provide a rationale to target them in potential therapy.

Acknowledgements

We sincerely appreciate all the contributors to databases and web platforms for cancer research. This work was supported by the National Natural Science Foundation of China (82003001) and Foundation of Xinglin Scholar Research Premotion Project of Chengdu University of TCM (ZRYY1920 and CGZH2018005).

Funding

His work was supported by the National Natural Science Foundation of China (82003001) and Foundation of Xinglin Scholar Research Premotion Project of Chengdu University of TCM (ZRYY1920 and CGZH2018005).

Conflict of interest

None declared.

References

1.

Iyer
M.K.
,
Niknafs
Y.S.
,
Malik
R.
et al.  (
2015
)
The landscape of long noncoding RNAs in the human transcriptome
.
Nat. Genet.
,
47
,
199
208
.

2.

Wang
K.C.
and
Chang
H.Y.
(
2011
)
Molecular mechanisms of long noncoding RNAs
.
Mol. Cell
,
43
,
904
914
.

3.

Fatica
A.
and
Bozzoni
I.
(
2014
)
Long non-coding RNAs: new players in cell differentiation and development
.
Nat. Rev. Genet.
,
15
,
7
21
.

4.

Taniue
K.
and
Akimitsu
N.
(
2021
)
The functions and unique features of LncRNAs in cancer development and tumorigenesis
.
Int. J. Mol. Sci.
,
22
, 632.

5.

Huang
Y.
,
Guo
Q.
,
Ding
X.P.
et al.  (
2020
)
Mechanism of long noncoding RNAs as transcriptional regulators in cancer
.
RNA Biol.
,
17
,
1680
1692
.

6.

Guo
F.
,
Li
L.
,
Yang
W.
et al.  (
2021
)
Long noncoding RNA: a resident staff of genomic instability regulation in tumorigenesis
.
Cancer Lett.
,
503
,
103
109
.

7.

Hu
Z.
,
Mi
S.
,
Zhao
T.
et al.  (
2020
)
BGL3 lncRNA mediates retention of the BRCA1/BARD1 complex at DNA damage sites
.
EMBO J.
,
39
, e104133.

8.

Chu
W.
,
Zhang
X.
,
Qi
L.
et al.  (
2020
)
The EZH2-PHACTR2-AS1-ribosome axis induces genomic instability and promotes growth and metastasis in breast cancer
.
Cancer Res.
,
80
,
2737
2750
.

9.

Lu
S.
,
Zhang
J.
,
Lian
X.
et al.  (
2019
)
A hidden human proteome encoded by ‘non-coding’ genes
.
Nucleic Acids Res.
,
47
,
8111
8125
.

10.

Hu
Q.
,
Ye
Y.
,
Chan
L.C.
et al.  (
2019
)
Oncogenic lncRNA downregulates cancer cell antigen presentation and intrinsic tumor suppression
.
Nat. Immunol.
,
20
,
835
851
.

11.

Li
D.
,
Tong
Q.
,
Lian
Y.
et al.  (
2021
)
Inhibition of lncRNA KCNQ1OT1 improves apoptosis and chemotherapy drug response in small cell lung cancer by TGF-β1 mediated EMT
.
Cancer Res. Treat
. 10.4143/crt.2020.1208

12.

Chen
J.
,
Liu
A.
,
Wang
Z.
et al.  (
2020
)
LINC00173.v1 promotes angiogenesis and progression of lung squamous cell carcinoma by sponging miR-511-5p to regulate VEGFA expression
.
Mol. Cancer
,
19
, 98.

13.

Xu
W.W.
,
Zheng
C.C.
,
Zuo
Q.
et al.  (
2021
)
Genome-wide identification of key regulatory lncRNAs in esophageal cancer metastasis
.
Signal Transduction Targeted Ther.
,
6
, 88.

14.

Zhao
L.
,
Wang
J.
,
Li
Y.
et al.  (
2021
)
NONCODEV6: an updated database dedicated to long non-coding RNA annotation in both animals and plants
.
Nucleic Acids Res.
,
49
,
D165
D171
.

15.

Wang
C.
,
Wang
J.
,
Shen
X.
et al.  (
2021
)
LncRNA SPOCD1-AS from ovarian cancer extracellular vesicles remodels mesothelial cells to promote peritoneal metastasis via interacting with G3BP1
.
J. Exp. Clin. Cancer Res.
,
40
, 101.

16.

Athie
A.
,
Marchese
F.P.
,
González
J.
et al.  (
2020
)
Analysis of copy number alterations reveals the lncRNA ALAL-1 as a regulator of lung cancer immune evasion
.
J. Cell Biol.
,
219
, e201908078.

17.

El-Fattah
A.A.A.
,
Sadik
N.A.H.
,
Shaker
O.G.
et al.  (
2021
)
Serum long non-coding RNAs PVT1, HOTAIR, and NEAT1 as potential biomarkers in Egyptian women with breast cancer
.
Biomolecules
,
11
, 301.

18.

Mer
A.S.
,
Lindberg
J.
,
Nilsson
C.
et al.  (
2018
)
Expression levels of long non-coding RNAs are prognostic for AML outcome
.
J. Hematol. Oncol.
,
11
, 52.

19.

Matsui
M.
and
Corey
D.R.
(
2017
)
Non-coding RNAs as drug targets
.
Nat. Rev. Drug. Discov.
,
16
,
167
179
.

20.

Jenjaroenpun
P.
,
Wongsurawat
T.
,
Pereira
R.
et al.  (
2018
)
Complete genomic and transcriptional landscape analysis using third-generation sequencing: a case study of Saccharomyces cerevisiae CEN.PK113-7D
.
Nucleic Acids Res.
,
46
, e38.

21.

Arnes
L.
,
Liu
Z.
,
Wang
J.
et al.  (
2019
)
Comprehensive characterisation of compartment-specific long non-coding RNAs associated with pancreatic ductal adenocarcinoma
.
Gut
,
68
,
499
511
.

22.

Bhartiya
D.
,
Pal
K.
,
Ghosh
S.
et al.  (
2013
)
lncRNome: a comprehensive knowledgebase of human long noncoding RNAs
.
Database
,
2013
, bat034.

23.

Harrow
J.
,
Denoeud
F.
,
Frankish
A.
et al.  (
2006
)
GENCODE: producing a reference annotation for ENCODE
.
Genome Biol.
,
7
,
S4.1
S4.9
.

24.

Kent
W.J.
,
Sugnet
C.W.
,
Furey
T.S.
et al.  (
2002
)
The human genome browser at UCSC
.
Genome Res.
,
12
,
996
1006
.

25.

Cava
C.
,
Bertoli
G.
and
Castiglioni
I.
(
2019
)
Portrait of tissue-specific coexpression networks of noncoding RNAs (miRNA and lncRNA) and mRNAs in normal tissues
.
Comput. Math. Methods Med.
,
2019
, 9029351.

26.

Luo
K.
,
Geng
J.
,
Zhang
Q.
et al.  (
2019
)
LncRNA CASC9 interacts with CPSF3 to regulate TGF-β signaling in colorectal cancer
.
J. Exp. Clin. Cancer Res.
,
38
, 249.

27.

Volders
P.-J.
,
Helsens
K.
,
Wang
X.
et al.  (
2013
)
LNCipedia: a database for annotated human lncRNA transcript sequences and structures
.
Nucleic Acids Res.
,
41
,
D246
D251
.

28.

Volders
P.-J.
,
Anckaert
J.
,
Verheggen
K.
et al.  (
2019
)
LNCipedia 5: towards a reference set of human long non-coding RNAs
.
Nucleic Acids Res.
,
47
,
D135
D139
.

29.

Amaral
P.P.
,
Clark
M.B.
,
Gascoigne
D.K.
et al.  (
2011
)
lncRNAdb: a reference database for long noncoding RNAs
.
Nucleic Acids Res.
,
39
,
D146
D151
.

30.

Zerbino
D.R.
,
Achuthan
P.
,
Akanni
W.
et al.  (
2018
)
Ensembl 2018
.
Nucleic Acids Res.
,
46
,
D754
D761
.

31.

Derrien
T.
,
Johnson
R.
,
Bussotti
G.
et al.  (
2012
)
The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression
.
Genome Res.
,
22
,
1775
1789
.

32.

Wang
X.
and
El Naqa
I.M.
(
2008
)
Prediction of both conserved and nonconserved microRNA targets in animals
.
Bioinformatics
,
24
,
325
332
.

33.

Martens
L.
,
Hermjakob
H.
,
Jones
P.
et al.  (
2005
)
PRIDE: the proteomics identifications database
.
Proteomics
,
5
,
3537
3545
.

34.

Cedro-Tanda
A.
,
Rios-Romero
M.
,
Romero-Cordoba
S.
et al.  (
2020
)
A lncRNA landscape in breast cancer reveals a potential role for AC009283.1 in proliferation and apoptosis in HER2-enriched subtype
.
Sci. Rep.
,
10
, 13146.

35.

Liu
C.
,
Bai
B.
,
Skogerbø
G.
et al.  (
2005
)
NONCODE: an integrated knowledge database of non-coding RNAs
.
Nucleic Acids Res.
,
33
,
D112
D115
.

36.

Howe
K.L.
,
Contreras-Moreira
B.
,
De Silva
N.
et al.  (
2020
)
Ensembl Genomes 2020-enabling non-vertebrate genomic research
.
Nucleic Acids Res.
,
48
,
D689
D695
.

37.

Haft
D.H.
,
DiCuccio
M.
,
Badretdin
A.
et al.  (
2018
)
RefSeq: an update on prokaryotic genome annotation and curation
.
Nucleic Acids Res.
,
46
,
D851
D860
.

38.

Quek
X.C.
,
Thomson
D.W.
,
Maag
J.L.V.
et al.  (
2015
)
lncRNAdb v2.0: expanding the reference database for functional long noncoding RNAs
.
Nucleic Acids Res.
,
43
,
D168
D173
.

39.

Szcześniak
M.W.
,
Bryzghalov
O.
,
Ciomborowska-Basheer
J.
et al.  (
2019
)
CANTATAdb 2.0: expanding the collection of plant long noncoding RNAs
.
Methods Mol. Biol.
,
1933
,
415
429
.

40.

Sale
M.M.
,
Mychaleckyj
J.C.
and
Chen
W.M.
(
2009
)
Planning and executing a genome wide association study (GWAS)
.
Methods Mol. Biol.
,
590
,
403
418
.

41.

Lv
D.
,
Xu
K.
,
Jin
X.
et al.  (
2020
)
LncSpA: LncRNA spatial atlas of expression across normal and cancer tissues
.
Cancer Res.
,
80
,
2067
2071
.

42.

Gao
Y.
,
Shang
S.
,
Guo
S.
et al.  (
2020
)
Lnc2Cancer 3.0: an updated resource for experimentally supported lncRNA/circRNA cancer associations and web tools based on RNA-seq and scRNA-seq data
.
Nucleic Acids Res.
,
49
,
D1251
D1258
.

43.

Ma
L.
,
Li
A.
,
Zou
D.
et al.  (
2015
)
LncRNAWiki: harnessing community knowledge in collaborative curation of human long non-coding RNAs
.
Nucleic Acids Res.
,
43
,
D187
D192
.

44.

Zhao
H.
,
Shi
J.
,
Zhang
Y.
et al.  (
2020
)
LncTarD: a manually-curated database of experimentally-supported functional lncRNA-target regulations in human diseases
.
Nucleic Acids Res.
,
48
,
D118
D126
.

45.

Cui
T.
,
Zhang
L.
,
Huang
Y.
et al.  (
2018
)
MNDR v2.0: an updated resource of ncRNA-disease associations in mammals
.
Nucleic Acids Res.
,
46
,
D371
D374
.

46.

Bao
Z.
,
Yang
Z.
,
Huang
Z.
et al.  (
2019
)
LncRNADisease 2.0: an updated database of long non-coding RNA-associated diseases
.
Nucleic Acids Res.
,
47
,
D1034
D1037
.

47.

Barrett
T.
,
Wilhite
S.E.
,
Ledoux
P.
et al.  (
2013
)
NCBI GEO: archive for functional genomics data sets—update
.
Nucleic Acids Res.
,
41
,
D991
D995
.

48.

Yeh
C.C.
,
Luo
J.L.
,
Nhut Phan
N.
et al.  (
2018
)
Different effects of long noncoding RNA NDRG1-OT1 fragments on NDRG1 transcription in breast cancer cells under hypoxia
.
RNA Biol.
,
15
,
1487
1498
.

49.

Ma
L.
,
Cao
J.
,
Liu
L.
et al.  (
2019
)
LncBook: a curated knowledgebase of human long non-coding RNAs
.
Nucleic Acids Res.
,
47
,
D128
D134
.

50.

Fang
S.
,
Zhang
L.
,
Guo
J.
et al.  (
2018
)
NONCODEV5: a comprehensive annotation database for long non-coding RNAs
.
Nucleic Acids Res.
,
46
,
D308
D314
.

51.

Volders
P.J.
,
Verheggen
K.
,
Menschaert
G.
et al.  (
2015
)
An update on LNCipedia: a database for annotated human lncRNA sequences
.
Nucleic Acids Res.
,
43
,
4363
4364
.

52.

Wang
L.
,
Park
H.J.
,
Dasari
S.
et al.  (
2013
)
CPAT: coding-potential assessment tool using an alignment-free logistic regression model
.
Nucleic Acids Res.
,
41
, e74.

53.

Li
A.
,
Zhang
J.
and
Zhou
Z.
(
2014
)
PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme
.
BMC Bioinform.
,
15
, 311.

54.

Lewis
B.P.
,
Burge
C.B.
and
Bartel
D.P.
(
2005
)
Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets
.
Cell
,
120
,
15
20
.

55.

Betel
D.
,
Wilson
M.
,
Gabow
A.
et al.  (
2008
)
The microRNA.org resource: targets and expression
.
Nucleic Acids Res.
,
36
,
D149
D153
.

56.

Uhlén
M.
,
Fagerberg
L.
,
Hallström
B.M.
et al.  (
2015
)
Proteomics. Tissue-based map of the human proteome
.
Science
,
347
, 1260419.

57.

Bolger
A.M.
,
Lohse
M.
and
Usadel
B.
(
2014
)
Trimmomatic: a flexible trimmer for Illumina sequence data
.
Bioinformatics
,
30
,
2114
2120
.

58.

Pertea
M.
,
Kim
D.
,
Pertea
G.M.
et al.  (
2016
)
Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown
.
Nat. Protoc.
,
11
,
1650
1667
.

59.

Trapnell
C.
,
Roberts
A.
,
Goff
L.
et al.  (
2012
)
Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks
.
Nat. Protoc.
,
7
,
562
578
.

60.

Qiao
K.
,
Ning
S.
,
Wan
L.
et al.  (
2019
)
LINC00673 is activated by YY1 and promotes the proliferation of breast cancer cells via the miR-515-5p/MARK4/Hippo signaling pathway
.
J. Exp. Clin. Cancer Res.
,
38
, 418.

61.

Hou
M.
,
Tang
X.
,
Tian
F.
et al.  (
2016
)
AnnoLnc: a web server for systematically annotating novel human lncRNAs
.
BMC Genomics
,
17
, 931.

62.

Ke
L.
,
Yang
D.C.
,
Wang
Y.
et al.  (
2020
)
AnnoLnc2: the one-stop portal to systematically annotate novel lncRNAs for human and mouse
.
Nucleic Acids Res.
,
48
,
W230
W238
.

63.

Wilks
C.
,
Cline
M.S.
,
Weiler
E.
et al.  (
2014
)
The Cancer Genomics Hub (CGHub): overcoming cancer through the power of torrential data
.
Database (Oxford)
,
2014
, bau093.

64.

Bonilauri
B.
and
Dallagiovanna
B.
(
2020
)
Long non-coding RNAs are differentially expressed after different exercise training programs
.
Front. Physiol.
,
11
, 567614.

65.

Gao
L.
,
Nie
X.
,
Zhang
W.
et al.  (
2020
)
Identification of long noncoding RNA RP11-89K21.1 and RP11-357H14.17 as prognostic signature of endometrial carcinoma via integrated bioinformatics analysis
.
Cancer Cell Int.
,
20
, 268.

66.

Lan
Z.
,
Yao
X.
,
Sun
K.
et al.  (
2020
)
The interaction between lncRNA SNHG6 and hnRNPA1 contributes to the growth of colorectal cancer by enhancing aerobic glycolysis through the regulation of alternative splicing of PKM
.
Front. Oncol.
,
10
, 363.

67.

Liu
M.
,
Zhong
J.
,
Zeng
Z.
et al.  (
2019
)
Hypoxia-induced feedback of HIF-1alpha and lncRNA-CF129 contributes to pancreatic cancer progression through stabilization of p53 protein
.
Theranostics
,
9
,
4795
4810
.

68.

Li
Z.
,
Liu
L.
,
Jiang
S.
et al.  (
2020
)
LncExpDB: an expression database of human long non-coding RNAs
.
Nucleic Acids Res.
,
49
,
D962
D968
.

69.

Jiang
S.
,
Cheng
S.J.
,
Ren
L.C.
et al.  (
2019
)
An expanded landscape of human long noncoding RNA
.
Nucleic Acids Res.
,
47
,
7842
7856
.

70.

Harrow
J.
,
Frankish
A.
,
Gonzalez
J.M.
et al.  (
2012
)
GENCODE: the reference human genome annotation for the ENCODE project
.
Genome Res.
,
22
,
1760
1774
.

71.

Pertea
M.
,
Shumate
A.
,
Pertea
G.
et al.  (
2018
)
CHESS: a new human gene catalog curated from thousands of large-scale RNA sequencing experiments reveals extensive transcriptional noise
.
Genome Biol.
,
19
, 208.

72.

Hon
C.C.
,
Ramilowski
J.A.
,
Harshbarger
J.
et al.  (
2017
)
An atlas of human long non-coding RNAs with accurate 5ʹ ends
.
Nature
,
543
,
199
204
.

73.

Kang
Y.J.
,
Yang
D.C.
,
Kong
L.
et al.  (
2017
)
CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features
.
Nucleic Acids Res.
,
45
,
W12
W16
.

74.

Chen
J.
,
Zhang
J.
,
Gao
Y.
et al.  (
2021
)
LncSEA: a platform for long non-coding RNA related sets and enrichment analysis
.
Nucleic Acids Res.
,
49
,
D969
D980
.

75.

Mei
S.
,
Qin
Q.
,
Wu
Q.
et al.  (
2017
)
Cistrome data browser: a data portal for ChIP-Seq and chromatin accessibility data in human and mouse
.
Nucleic Acids Res.
,
45
,
D658
D662
.

76.

Corces
M.R.
,
Granja
J.M.
,
Shams
S.
et al.  (
2018
)
The chromatin accessibility landscape of primary human cancers
.
Science
,
362
, 6413.

77.

Gao
Y.
,
Wang
P.
,
Wang
Y.
et al.  (
2019
)
Lnc2Cancer v2.0: updated database of experimentally supported long non-coding RNAs in human cancers
.
Nucleic Acids Res.
,
47
,
D1028
D1033
.

78.

Wang
P.
,
Li
X.
,
Gao
Y.
et al.  (
2019
)
LncACTdb 2.0: an updated database of experimentally supported ceRNA interactions curated from low- and high-throughput experiments
.
Nucleic Acids Res.
,
47
,
D121
D127
.

79.

Gong
J.
,
Liu
W.
,
Zhang
J.
et al.  (
2015
)
lncRNASNP: a database of SNPs in lncRNAs and their potential functions in human and mouse
.
Nucleic Acids Res.
,
43
,
D181
D186
.

80.

Miao
Y.R.
,
Liu
W.
,
Zhang
Q.
et al.  (
2018
)
lncRNASNP2: an updated database of functional SNPs and mutations in human and mouse lncRNAs
.
Nucleic Acids Res.
,
46
,
D276
D280
.

81.

Forbes
S.A.
,
Bindal
N.
,
Bamford
S.
et al.  (
2011
)
COSMIC: mining complete cancer genomes in the catalogue of somatic mutations in cancer
.
Nucleic Acids Res.
,
39
,
D945
D950
.

82.

Tomczak
K.
,
Czerwińska
P.
and
Wiznerowicz
M.
(
2015
)
The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge
.
Contemp. Oncol. (Pozn)
,
19
,
A68
A77
.

83.

Lu
M.
,
Shi
B.
,
Wang
J.
et al.  (
2010
)
TAM: a method for enrichment and depletion analysis of a microRNA category in a list of microRNAs
.
BMC Bioinform.
,
11
, 419.

84.

Zhao
W.
,
Wang
J.
,
Luo
Q.
et al.  (
2020
)
Identification of LINC02310 as an enhancer in lung adenocarcinoma and investigation of its regulatory network via comprehensive analyses
.
BMC Med. Genomics
,
13
, 185.

85.

Abdi
E.
,
Latifi-Navid
S.
,
Zahri
S.
et al.  (
2020
)
SNP-SNP interactions of oncogenic long non-coding RNAs HOTAIR and HOTTIP on gastric cancer susceptibility
.
Sci. Rep.
,
10
, 16763.

86.

Feng
T.
,
Feng
N.
,
Zhu
T.
et al.  (
2020
)
A SNP-mediated lncRNA (LOC146880) and microRNA (miR-539-5p) interaction and its potential impact on the NSCLC risk
.
J. Exp. Clin. Cancer Res.
,
39
, 157.

87.

Jiang
Q.
,
Wang
J.
,
Wu
X.
et al.  (
2015
)
LncRNA2Target: a database for differentially expressed genes after lncRNA knockdown or overexpression
.
Nucleic Acids Res.
,
43
,
D193
D196
.

88.

Cheng
L.
,
Wang
P.
,
Tian
R.
et al.  (
2019
)
LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse
.
Nucleic Acids Res.
,
47
,
D140
D144
.

89.

Liu
Z.
,
Zhang
Y.
,
Han
X.
et al.  (
2020
)
Identifying cancer-related lncRNAs based on a convolutional neural network
.
Front. Cell Dev. Biol.
,
8
, 637.

90.

Cao
Z.
,
Pan
X.
,
Yang
Y.
et al.  (
2018
)
The lncLocator: a subcellular localization predictor for long non-coding RNAs based on a stacked ensemble classifier
.
Bioinformatics
,
34
,
2185
2194
.

91.

Lin
Y.
,
Pan
X.
and
Shen
H.B.
(
2021
)
lncLocator 2.0: a cell-line-specific subcellular localization predictor for long non-coding RNAs with interpretable deep learning
.
Bioinformatics
, btab127. doi:

92.

Zhou
H.
,
Yang
Y.
and
Shen
H.B.
(
2017
)
Hum-mPLoc 3.0: prediction enhancement of human protein subcellular localization through modeling the hidden correlations of gene ontology and functional domain features
.
Bioinformatics
,
33
,
843
853
.

93.

Park
K.J.
and
Kanehisa
M.
(
2003
)
Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs
.
Bioinformatics
,
19
,
1656
1663
.

94.

Hu
J.
,
He
X.
,
Yu
D.J.
et al.  (
2014
)
A new supervised over-sampling algorithm with application to protein-nucleotide binding residue prediction
.
PLoS One
,
9
, e107676.

95.

Yang
J.
,
Qiu
Q.
,
Qian
X.
et al.  (
2019
)
Long noncoding RNA LCAT1 functions as a ceRNA to regulate RAC1 function by sponging miR-4715-5p in lung cancer
.
Mol. Cancer
,
18
, 171.

96.

Lu
Q.
,
Gao
J.
,
Tang
S.
et al.  (
2020
)
Integrated RNA sequencing and single-cell mass cytometry reveal a novel role of LncRNA HOXA-AS2 in tumorigenesis and stemness of hepatocellular carcinoma
.
Onco. Targets Ther.
,
13
,
10901
10916
.

97.

Chen
G.
,
Wang
Z.
,
Wang
D.
et al.  (
2013
)
LncRNADisease: a database for long-non-coding RNA-associated diseases
.
Nucleic Acids Res.
,
41
,
D983
D986
.

98.

Kibbe
W.A.
,
Arze
C.
,
Felix
V.
et al.  (
2015
)
Disease ontology 2015 update: an expanded and updated database of human diseases for linking biomedical knowledge through disease data
.
Nucleic Acids Res.
,
43
,
D1071
D1078
.

99.

Bhattacharya
S.
,
Ha-Thuc
V.
and
Srinivasan
P.
(
2011
)
MeSH: a window into full text for document summarization
.
Bioinformatics
,
27
,
i120
i128
.

100.

Hou
M.
,
Wu
N.
and
Yao
L.
(
2021
)
LncRNA CBR3-AS1 potentiates Wnt/beta-catenin signaling to regulate lung adenocarcinoma cells proliferation, migration and invasion
.
Cancer Cell Int.
,
21
, 36.

101.

Li
J.
,
Han
L.
,
Roebuck
P.
et al.  (
2015
)
TANRIC: an interactive open platform to explore the function of lncRNAs in cancer
.
Cancer Res.
,
75
,
3728
3737
.

102.

Barretina
J.
,
Caponigro
G.
,
Stransky
N.
et al.  (
2012
)
The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity
.
Nature
,
483
,
603
607
.

103.

Wirsching
H.G.
,
Galanis
E.
and
Weller
M.
(
2016
)
Glioblastoma
.
Handb. Clin. Neurol.
,
134
,
381
397
.

104.

Mortazavi
A.
,
Williams
B.A.
,
McCue
K.
et al.  (
2008
)
Mapping and quantifying mammalian transcriptomes by RNA-Seq
.
Nat. Methods
,
5
,
621
628
.

105.

Feng
L.
,
Zhang
J.
,
Sun
M.
et al.  (
2020
)
Tumor suppressor LINC02487 inhibits oral squamous cell carcinoma cell migration and invasion through the USP17-SNAI1 axis
.
Front. Oncol.
,
10
, 559808.

106.

Zhu
Y.
,
Bo
H.
,
Chen
Z.
, et al.  (
2020
)
LINC00968 can inhibit the progression of lung adenocarcinoma through the miR-21-5p/SMAD7 signal axis
.
Aging (Albany NY)
,
12
,
21904
21922
.

107.

Ning
S.
,
Zhang
J.
,
Wang
P.
et al.  (
2016
)
Lnc2Cancer: a manually curated database of experimentally supported lncRNAs associated with various human cancers
.
Nucleic Acids Res.
,
44
,
D980
D985
.

108.

Amberger
J.S.
and
Hamosh
A.
(
2017
)
Searching Online Mendelian Inheritance in Man (OMIM): a knowledgebase of human genes and genetic phenotypes
.
Curr. Protoc. Bioinf.
,
58
,
1.2.1
1.2.12
.

109.

Forbes
S.A.
,
Tang
G.
,
Bindal
N.
et al.  (
2010
)
COSMIC (the Catalogue of Somatic Mutations in Cancer): a resource to investigate acquired mutations in human cancer
.
Nucleic Acids Res.
,
38
,
D652
D657
.

110.

Du
Z.
,
Gao
W.
,
Sun
J.
et al.  (
2019
)
Identification of long noncoding RNAmediated transcriptional dysregulation triplets reveals global patterns and prognostic biomarkers for ER+/PR+, HER2 and triple negative breast cancer
.
Int. J. Mol. Med.
,
44
,
1015
1025
.

111.

Ren
C.
,
An
G.
,
Zhao
C.
et al.  (
2018
)
Lnc2Catlas: an atlas of long noncoding RNAs associated with risk of cancers
.
Sci. Rep.
,
8
, 1909.

112.

Luo
T.
,
Chen
M.
,
Zhao
Y.
et al.  (
2019
)
Macrophage-associated lncRNA ELMO1-AS1: a novel therapeutic target and prognostic biomarker for hepatocellular carcinoma
.
Onco. Targets Ther.
,
12
,
6203
6216
.

113.

Zheng
Y.
,
Xu
Q.
,
Liu
M.
et al.  (
2019
)
lnCAR: a comprehensive resource for lncRNAs from cancer arrays
.
Cancer Res.
,
79
,
2076
2083
.

114.

Lockstone
H.E.
(
2011
)
Exon array data analysis using Affymetrix power tools and R statistical software
.
Brief. Bioinf.
,
12
,
634
644
.

115.

Ritchie
M.E.
,
Phipson
B.
,
Wu
D.
et al.  (
2015
)
Limma powers differential expression analyses for RNA-sequencing and microarray studies
.
Nucleic Acids Res.
,
43
, e47.

116.

Jia
J.
,
Li
H.
,
Chu
J.
et al.  (
2021
)
LncRNA FAM83A-AS1 promotes ESCC progression by regulating miR-214/CDC25B axis
.
J. Cancer
,
12
,
1200
1211
.

117.

Zhang
S.
,
He
X.
,
Zhang
R.
et al.  (
2020
)
LncR2metasta: a manually curated database for experimentally supported lncRNAs during various cancer metastatic events
.
Brief. Bioinf
,
22
, bbaa178.

118.

Baxevanis
A.D.
(
2006
)
Searching the NCBI databases using Entrez
.
Curr. Protoc. Bioinf.
Chapter 1, Unit 1.3.

119.

Yates
A.D.
,
Achuthan
P.
,
Akanni
W.
et al.  (
2020
)
Ensembl 2020
.
Nucleic Acids Res.
,
48
,
D682
D688
.

120.

Li
J.
,
Xue
Y.
,
Amin
M.T.
et al.  (
2020
)
ncRNA-eQTL: a database to systematically evaluate the effects of SNPs on non-coding RNA expression across cancer types
.
Nucleic Acids Res.
,
48
,
D956
D963
.

121.

Suvanto
M.
,
Beesley
J.
,
Blomqvist
C.
et al.  (
2020
)
SNPs in lncRNA regions and breast cancer risk
.
Front Genet.
,
11
, 550.

122.

Iwakiri
J.
,
Hamada
M.
and
Asai
K.
(
2016
)
Bioinformatics tools for lncRNA research
.
Biochim. Biophys. Acta
,
1859
,
23
30
.

123.

Pinkney
H.R.
,
Wright
B.M.
and
Diermeier
S.D.
(
2020
)
The lncRNA toolkit: databases and in silico tools for lncRNA analysis
.
Noncoding RNA
,
6
, 49.

Author notes

Shixing Gu and Guangjie Zhang contributed equally to this work.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.