Downloadable data
1. Data Information
SoybeanTFtarget compiled 562 unique RNA-seq datasets from the NCBI GEO database, focusing on whole-based samples without biological replicates. From these, 25 representative samples were selected, one from each tissue, to create a tissue-based samples.
2. Promoter Sequences
Promoter sequence of soybean gene.
We selected the first 2000 bp upstream of the first CDS for each soybean gene as its promoter, with the sequences provided in FASTA format.
3. TF information
This file mainly corresponds to the information of the BROWSE module
Description:
column 1: Transcription factor ID
column 2: Transcription factor gene ID
column 3: Transcription factor annotation
column 4: Transcription factor motif type
type Ⅰ represents a transcription factor with a set of manually curated, non-redundant, high-quality TF binding motifs that includes motifs from both PlantTFdb and those obtained from DAP-seq.
type Ⅱ TFs within the same family that have sequence similarity (e-value < 1e-4) to Type I TFs share the same motif.
type Ⅲ represents TFs with no binding motifs (e-value > 1e-4).
column 5: Transcription factor family
column 6: the ID of the TF homologous, if it is a TF homologous with type 1
column 7: evalue
4. Gene Expression matrix of 562 unique RNA-seq samples
including both transcription factors and other coding genes besides transcription factors.
5. Gene Expression matrix of 25 tissue-based samples
including both transcription factors and other coding genes besides transcription factors.