SUBSEQ.BIO
DOCS-EVO 2

Evo 2

DNA language model jobs for sequence generation, likelihood scoring, and embedding extraction.

How SubSeq Runs It

Input

Use typed DNA sequence text, one FASTA file, or a folder of FASTA files. Sequences should use A, C, G, T, and N.

In the job form, choose the operation first, then choose Sequence Input. If you choose a FASTA file or folder, the next source selector only asks where that file or folder comes from: paste/upload, dataset, or previous job output.

>example
ACGTACGTACGTACGT

Example Arguments

Generate

generate
--sequence=ACGTACGTACGT
--name=prompt
--n-tokens=400
--num-samples=1
--temperature=1.0
--top-k=4

Score

score
--fasta=/inputs/sequences.fasta
--reduce-method=mean
--average-reverse-complement

Embeddings

embed
--input-folder=/inputs/fastas
--recursive
--pool=mean

Arguments

Global

Input

Generation

Scoring

Embeddings

Outputs

Submit

Queue a run from New Job -> Evo 2.