DNA Sequence
Deoxyribonucleic acid sequence in IUPAC notation (letters: A, T, G, C, and ambiguity codes).
DNA Sequence
representation.scientific.dna_sequenceDeoxyribonucleic acid sequence in IUPAC notation (letters: A, T, G, C, and ambiguity codes).
Domain
representation
Category
scientific
Casts to
VARCHAR
Scope
Universal
Try it
CLI
$ finetype infer -i "ATGCAGC" --mode column
→ representation.scientific.dna_sequenceDuckDB
Detect
SELECT ft_infer('ATGCAGC');
-- → 'representation.scientific.dna_sequence'Cast expression
UPPER(CAST({col} AS VARCHAR))Safe cast pipeline
-- Normalise and cast in one step
SELECT TRY_CAST(ft_cast(my_column) AS VARCHAR) AS clean_value
FROM my_table
WHERE ft_infer(my_column) = 'representation.scientific.dna_sequence';Struct Expansion
Expression
gc_content: CAST(REGEXP_COUNT({col}, '[GC]') AS DOUBLE) / LENGTH({col})
length: LENGTH({col})JSON Schema
finetype taxonomy representation.scientific.dna_sequence -o json-schema
{
"$id": "https://meridian.online/schemas/representation.scientific.dna_sequence",
"$schema": "https://json-schema.org/draft/2020-12/schema",
"description": "Deoxyribonucleic acid sequence in IUPAC notation (letters: A, T, G, C, and ambiguity codes).",
"examples": [
"ATGCAGC",
"GCTAGCTAGCTAG",
"ATGATGATG"
],
"pattern": "^[ATGCRYSWKMBDHVN]+$",
"title": "DNA Sequence",
"type": "string",
"x-finetype-label": "representation.scientific.dna_sequence",
"x-finetype-pii": false
}Examples
ATGCAGCGCTAGCTAGCTAGATGATGATGAliases
dna