SomaLogic Mapping
somalogic_mapping.RmdSomaLogic Mapping Dataset
This dataset provides a mapping of SomaLogic aptamers to their corresponding genomic coordinates and gene identifiers. The data is derived from the SomaLogic v5.0 Plasma ADAT file.
Data Generation Summary
The mapping was generated using the following steps:
- Input Data: Raw data was retrieved from the SomaLogic GitHub repository (Example Data v5.0 Plasma ADAT).
-
Formatting:
- Split multiple entries for UniProt, Entrez Gene ID, and Entrez Gene Symbol.
- Filtered for Human organism only.
- Filtered for Protein type (removing controls).
- Excluded all entries flagged as “internal use only”.
-
Mapping: UniProt IDs, Entrez IDs, and HGNC symbols
were mapped to Ensembl Gene IDs using
biomaRt(Ensembl datasethsapiens_gene_ensembl). -
Filtering:
- Only protein_coding genes were retained.
- Only standard chromosomes (1-22, X, Y) were retained.
- Mappings were cross-referenced to ensure symbol consistency.
-
Genomic Positions: Start and End positions were
retrieved for both hg19 and hg38
builds using
TxDb.Hsapiens.UCSC.hg19.knownGeneandTxDb.Hsapiens.UCSC.hg38.knownGene.
Column Descriptions
| Column | Description |
|---|---|
SeqId |
SomaLogic Sequence ID |
SomaId |
SomaLogic Aptamer ID |
UNIPROT |
UniProt ID provided by SomaLogic |
Target |
Target gene symbol (HGNC) |
TargetFullName |
Full protein name |
EntrezGeneID |
Entrez Gene ID provided by SomaLogic |
EntrezGeneSymbol |
Entrez Gene Symbol provided by SomaLogic |
uniprot_gn_id |
Mapped UniProt gene ID |
uniprot_gn_symbol |
Mapped UniProt gene symbol |
entrezgene_id |
Mapped Entrez gene ID |
entrezgene_accession |
Mapped Entrez accession |
hgnc_id |
Mapped HGNC ID |
hgnc_symbol |
Mapped HGNC symbol |
ensembl_gene_id |
Mapped Ensembl Gene ID |
external_gene_name |
External gene name |
gene_biotype |
Gene biotype (e.g. protein_coding) |
CHR |
Chromosome name |
START_hg19 |
Start position (hg19) |
END_hg19 |
End position (hg19) |
strand_hg19 |
Strand (hg19) |
START_hg38 |
Start position (hg38) |
END_hg38 |
End position (hg38) |
strand_hg38 |
Strand (hg38) |