SoyTFtarget

An Integrated Database for Predicting Transcription Factor and Target Gene Regulatory Relationships.


Downloadable data



1. Data Information

SoybeanTFtarget compiled 562 unique RNA-seq datasets from the NCBI GEO database, focusing on whole-based samples without biological replicates. From these, 25 representative samples were selected, one from each tissue, to create a tissue-based samples.
information_whole.txt
TSV-file, 124K

information_tissue.txt
TSV-file, 8.0K

2. Promoter Sequences

Promoter sequence of soybean gene.
We selected the first 2000 bp upstream of the first CDS for each soybean gene as its promoter, with the sequences provided in FASTA format.
promoter_soybean.fa
FASTA-file, 102M

3. TF information

This file mainly corresponds to the information of the BROWSE module
Description:
column 1: Transcription factor ID
column 2: Transcription factor gene ID
column 3: Transcription factor annotation
column 4: Transcription factor motif type
    type Ⅰ represents a transcription factor with a set of manually curated, non-redundant, high-quality TF binding motifs that includes motifs from both PlantTFdb and those obtained from DAP-seq.
    type Ⅱ TFs within the same family that have sequence similarity (e-value < 1e-4) to Type I TFs share the same motif.
    type Ⅲ represents TFs with no binding motifs (e-value > 1e-4).
column 5: Transcription factor family
column 6: the ID of the TF homologous, if it is a TF homologous with type 1
column 7: evalue
TF_list.txt
TSV-file, 628K

4. Gene Expression matrix of 562 unique RNA-seq samples

including both transcription factors and other coding genes besides transcription factors.

5. Gene Expression matrix of 25 tissue-based samples

including both transcription factors and other coding genes besides transcription factors.