CircleBase V2

An Integrated Platform for eccDNA Annotation Across Cancers and Species. Also see homepage

Scoring system for human

Dependencies

bedtools 2.0 or higher doc
python 3.7 www.python.org
numpy www.numpy.org
scipy www.scipy.org
other common packages: multiprocessing and argparse

tips: Anaconda is always a good choice to install the dependencies.

Input files: bed files for the six regulatory categories and eccDNAs

Chromatin_access.bed download
Chromatin_interaction.bed download
Epigenetic_regulation.bed download
Genetic_variant.bed download
Regulatory_elements.bed download
Targeting_genes.bed download
eccDNA_core.hg19.bed download

Supplementary files: chromosome-specific density of regulatory elements

stat.Chromatin_access.bed download
stat.Chromatin_interaction.bed download
stat.Epigenetic_regulation.bed download
stat.Genetic_variant.bed download
stat.Regulatory_elements.bed download
stat.Targeting_genes.bed download

How to run

Go to the scoring system/human directory and set up all the dependencies
Download all the input and supplementary files listed above and decompress them
Run the run.sh shell script

Output

hits.stat.* files are annotated hits (records) count for each eccDNA in four regulatory categories. The last field is the count number.
*.score files include score for each eccDNA corresponding Gaussian mode in four regulatory categories. Here are the fields:

eccDNA id.
Chromosome to which the eccDNA belongs.
Hits number for the eccDNA.
Hits number after Box-Cox transformation for the eccDNA.
Mean of the hits number for all eccDNAs at chromosome list on the second field (i.e., 𝜇 of the Gaussian distribution).
Standard Deviation of the hits number for all eccDNAs at chromosome list on the second field (i.e., 𝜎 of the Gaussian distribution).
Probability greater than the hits number in the corresponding Gaussian distribution.
The score for the eccDNA (i.e., negative of the base 10 logarithm of the Probability).

*.nor files include normalized score of each category. The first 8 columns are same as *.score files, column 9 is the Z-score of the regulatory category and column 10 is the normalized score.
final.score.txt file is the final result we want. Here are the fields:

eccDNA id.
Average of normalized scores for all six regulatory categories. download here

Scoring system for mouse

Dependencies

Same as human, see above

Input files: bed file for the four regulatory categories and eccDNAs

Chromatin_access.bed download
Epigenetic_regulation.bed download
Genetic_variant.bed download
Regulatory_elements.bed download
eccDNA_core.mm10.bed download

Supplementary files: chromosome-specific density of regulatory elements

stat.Chromatin_access.bed download
stat.Epigenetic_regulation.bed download
stat.Genetic_variant.bed download
stat.Regulatory_elements.bed download

How to run

Go to the scoring system/mouse directory and set up all the dependencies
Download all the input and supplementary files listed above and decompress them
Run the run.sh shell script

Output

hits.stat.* files are annotated hits (records) count for each eccDNA in four regulatory categories. The last field is the count number.
*.score files include score for each eccDNA corresponding Gaussian mode in four regulatory categories. Here are the fields:

eccDNA id.
Chromosome to which the eccDNA belongs.
Hits number for the eccDNA.
Hits number after Box-Cox transformation for the eccDNA.
Mean of the hits number for all eccDNAs at chromosome list on the second field (i.e., 𝜇 of the Gaussian distribution).
Standard Deviation of the hits number for all eccDNAs at chromosome list on the second field (i.e., 𝜎 of the Gaussian distribution).
Probability greater than the hits number in the corresponding Gaussian distribution.
The score for the eccDNA (i.e., negative of the base 10 logarithm of the Probability).

*.nor files include normalized score of each category. The first 8 columns are same as *.score files, column 9 is the Z-score of the regulatory category and column 10 is the normalized score.
final.score.txt file is the final result we want. Here are the fields:

eccDNA id.
Average of normalized scores for all four regulatory categories. download here

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
application		application
assets		assets
scoring system		scoring system
system		system
user_guide		user_guide
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
composer.json		composer.json
index.php		index.php

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CircleBase V2

Scoring system for human

Dependencies

Input files: bed files for the six regulatory categories and eccDNAs

Supplementary files: chromosome-specific density of regulatory elements

How to run

Output

Scoring system for mouse

Dependencies

Input files: bed file for the four regulatory categories and eccDNAs

Supplementary files: chromosome-specific density of regulatory elements

How to run

Output

About

Uh oh!

Releases 1

Packages

Languages

License

leishenggit/CircleBase2

Folders and files

Latest commit

History

Repository files navigation

CircleBase V2

Scoring system for human

Dependencies

Input files: bed files for the six regulatory categories and eccDNAs

Supplementary files: chromosome-specific density of regulatory elements

How to run

Output

Scoring system for mouse

Dependencies

Input files: bed file for the four regulatory categories and eccDNAs

Supplementary files: chromosome-specific density of regulatory elements

How to run

Output

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages