Evolvo

GFSL-aligned evolutionary search engine with supervised PyTorch guidance.

The project reimplements the Genetic Fixed Structure Language (GFSL) as described in the “GFSL Basic” and “GFSL Extensibility” papers. Evolvo keeps the strictly numeric, compressed instruction encoding (default 7 slots, auto-sized to the maximum declared expression length) while extending the framework with learning-based direction models that bias evolution toward higher-fitness regions.

📄 The original specifications live in papers/; start with GFSL-definition.md.

Highlights

Fixed compressed genome language (default 7 slots, auto-sized to max expression length) that guarantees every instruction is valid by construction.
Cascading slot validator with progressive type activation (decimal → boolean → tensor) and context-aware enumerations.
Effective algorithm extraction so execution only touches the instructions that matter.
Targeted operation extraction to pull only the dependencies for specific result references.
Neural architecture support via RecursiveModelBuilder, enabling GFSL to describe CNN/MLP topologies.
Hybrid guidance:
- GFSLQLearningGuide learns slot choices with tabular Q-learning.
- GFSLSupervisedGuide is a PyTorch model that predicts fitness and proposes smarter mutations.
Real-time evaluation (RealTimeEvaluator) with pluggable scoring aggregators and per-test callbacks.
Rich utilities for mutation, crossover, diversity tracking, and population management.
Ad-hoc personalization via register_custom_operation, letting you graft bespoke operations into the validator/executor without forking the core.

Installation

# clone the repo
git clone https://github.com/Geckos-Ink/evolvo.git
cd evolvo

# create a virtual environment (recommended)
python3 -m venv .venv
source .venv/bin/activate

# install required runtime dependencies
pip install torch numpy

The library only depends on NumPy and PyTorch. CUDA is optional; the supervised model automatically selects CPU when a GPU is unavailable.

Quick Start

The simplest entry point is the quadratic-formula example that couples the evolver with the supervised PyTorch guide.

python example.py

Typical output (truncated):

Gen 000 | fitness=0.214583 | guide≈      nan
Gen 010 | fitness=0.781942 | guide≈0.795312
...
Best genome fitness: 0.991332
Signature: 6c71b2e9407834ab...

Effective instruction trace:
  ✓ d#0 = 1.0
  ✓ d$2 = MUL(d$0, d$0)
  ✓ d$3 = MUL(d#1, d$0)
  ✓ d$1 = ADD(d$2, d$3)
  ✓ d$1 = ADD(d$1, d#0)

The demo:

Seeds reproducible randomness.
Builds a dataset for y = 0.5x² − x + 2.
Trains the GFSLSupervisedGuide on-the-fly as new genomes are evaluated.
Evolves 60 generations with real-time progress prints.
Displays the effective GFSL instruction trace and spot-check predictions.

Want to see runtime personalization instead? Run python example.py personalization to register and execute a bespoke decimal operation.

If you prefer to start from scratch, the minimal API looks like:

from evolvo import (
    DataType, GFSLEvolver, RealTimeEvaluator, GFSLSupervisedGuide
)

evolver = GFSLEvolver(population_size=30,
                      supervised_guide=GFSLSupervisedGuide())
evolver.initialize_population("algorithm", initial_instructions=12)

for genome in evolver.population:
    genome.mark_output(DataType.DECIMAL, 1)

def fitness_fn(genome):
    return evaluator.evaluate(genome)

evolver.evolve(80, fitness_fn)
best = evolver.population[0]
print(best.to_human_readable())

Additional Examples

Practical scripts live in examples/:

examples/expression_flow.py — slot-wise builder flow, option previews, and conditional consequents.
examples/neural_flow.py — SET/CONV neural flow with a consequent RELU.
examples/operation_extraction.py — dependency-based extraction for specific result references.

Library Tour

GFSL Core

Component	Purpose
`GFSLInstruction`	10-integer representation of each instruction.
`SlotValidator`	Enforces cascading slot validity and tracks active variables/constants.
`ValueEnumerations`	Operation- and config-specific lookup tables for numeric slots.
`GFSLGenome`	Holds instructions, output declarations, signatures, and operation/effective extraction helpers.
`GFSLExecutor`	Executes only the effective instructions while supporting injected inputs.

Evaluation & Search

Component	Purpose
`RealTimeEvaluator`	Runs genomes across multiple test cases, aggregates scores, and supports callbacks per evaluation.
`RecursiveModelBuilder`	Translates neural GFSL genomes into PyTorch `nn.Module`s (e.g., CNNs).
`GFSLEvolver`	Population management (init/mutate/crossover/selection), integrates optional guidance, and tracks diversity.

Guidance Strategies

GFSLQLearningGuide — a tabular, slot-wise Q-learning agent that chooses valid slot values.
GFSLSupervisedGuide — a learnable meta-model:
- Extracts structural features from genomes (GFSLFeatureExtractor).
- Learns a regression model (GFSLSupervisedDirectionModel) that predicts future fitness.
- Biases mutation by sampling several candidate offspring and choosing the one with the best predicted fitness.

Both approaches respect GFSL’s fixed-slot constraints; the supervised guide simply tilts the mutation operator toward more promising instruction combinations.

Operation Extraction

Use the extraction helpers to get only the minimal operations needed for one or more result references. You can keep the original execution order, or choose a fixed slot ordering to obtain a stable, enumerator-ordered signature for the same operation set.

from evolvo import DataType, GFSLGenome

genome = GFSLGenome("algorithm")
# ... build instructions ...

# Extract by explicit references (string, tuple, or dict forms).
indices = genome.extract_operation_indices(["d$3", (DataType.DECIMAL, 4)], order="fixed")
instructions = genome.extract_operations(["d$3"], order="execution")

# If no result refs are provided, outputs are used by default.
genome.mark_output(DataType.DECIMAL, 3)
indices = genome.extract_operation_indices(order="fixed")

Result reference forms:

"d$3" or "b#0"
(DataType.DECIMAL, 3) or (Category.VARIABLE, DataType.DECIMAL, 3)
{"category": Category.VARIABLE, "dtype": DataType.DECIMAL, "index": 3}

Ordering options:

"execution": original instruction index order.
"fixed": stable sort by slot values (enumerator order).
"topological": dependency order with a fixed tie-break by slot values.

Practical Examples

1. Formula Discovery (Algorithms)

from evolvo import (
    DataType, GFSLExecutor, GFSLEvolver, RealTimeEvaluator
)

test_cases = [{"d$0": float(x)} for x in range(-5, 6)]
expected = [{"d$1": float(x**2 + 2*x + 1)} for x in range(-5, 6)]

evaluator = RealTimeEvaluator(test_cases, expected)
evolver = GFSLEvolver(population_size=30)
evolver.initialize_population("algorithm", initial_instructions=15)

for genome in evolver.population:
    genome.mark_output(DataType.DECIMAL, 1)

evolver.evolve(100, evaluator.evaluate)
best = evolver.population[0]

print("Effective instructions:")
for line in best.to_human_readable():
    if line.startswith("✓"):
        print(" ", line)

executor = GFSLExecutor()
print(executor.execute(best, {"d$0": 3.0}))

2. Neural Architecture Assembly

import torch
from evolvo import (
    Category, ConfigProperty, DataType, GFSLGenome,
    GFSLInstruction, Operation, RecursiveModelBuilder, pack_type_index
)

genome = GFSLGenome("neural")
genome.validator.activate_type(DataType.TENSOR)

# SET CHANNELS = 64
genome.add_instruction(GFSLInstruction([
    Category.NONE, 0, Operation.SET,
    Category.CONFIG, ConfigProperty.CHANNELS,
    Category.VALUE, 5,
]))

# t$0 = CONV(t$0) followed by RELU
genome.add_instruction(GFSLInstruction([
    Category.VARIABLE, pack_type_index(DataType.TENSOR, 0), Operation.CONV,
    Category.VARIABLE, pack_type_index(DataType.TENSOR, 0),
    Category.NONE, 0,
]))
genome.add_instruction(GFSLInstruction([
    Category.VARIABLE, pack_type_index(DataType.TENSOR, 0), Operation.RELU,
    Category.VARIABLE, pack_type_index(DataType.TENSOR, 0),
    Category.NONE, 0,
]))

builder = RecursiveModelBuilder()
model = builder.build_from_genome(genome, (3, 32, 32))

x = torch.randn(1, 3, 32, 32)
print(model(x).shape)

3. Ad-hoc Operations (Personalization)

Use register_custom_operation when you need a one-off opcode without editing the core library. The helper wires your function into the slot validator, executor, and human-readable traces.

from evolvo import (
    Category, DataType, GFSLExecutor,
    GFSLGenome, GFSLInstruction, pack_type_index, register_custom_operation,
)

opcode = register_custom_operation(
    "affine_bias",
    target_type=DataType.DECIMAL,
    function=lambda value, bias, context=None: float(value) * 1.5 + float(bias or 0.0),
    arity=2,
    value_enumeration=[-1.0, 0.0, 0.5, 2.0],  # inline VALUE slots pull from this list
    doc="Scales the input and adds a selectable bias term.",
)

genome = GFSLGenome("algorithm")
genome.add_instruction(GFSLInstruction([
    Category.VARIABLE, pack_type_index(DataType.DECIMAL, 1), opcode,
    Category.VARIABLE, pack_type_index(DataType.DECIMAL, 0),
    Category.VALUE, 3,  # chooses bias=2.0
]))
genome.mark_output(DataType.DECIMAL, 1)

executor = GFSLExecutor()
print(executor.execute(genome, {"d$0": 2.0}))  # {'d$1': 5.0}

The optional context keyword argument receives {"executor": ..., "instruction": ...} so custom functions can peek at runtime state when needed.

4. Supervised Guided Evolution (From `example.py`)

from evolvo import (
    DataType, GFSLExecutor, GFSLEvolver,
    GFSLSupervisedGuide, RealTimeEvaluator
)

guide = GFSLSupervisedGuide(hidden_layers=[256, 128],
                            candidate_pool=4, min_buffer=64)
evolver = GFSLEvolver(population_size=32, supervised_guide=guide)
evolver.initialize_population("algorithm", initial_instructions=12)

for genome in evolver.population:
    genome.mark_output(DataType.DECIMAL, 1)

evolver.evolve(60, evaluator.evaluate)
best = evolver.population[0]
print(best.fitness, best.to_human_readable())

Customisation & Extensibility

Operations / Data types — extend Operation and DataType enums; the slot validator automatically respects new options.
Enumerations — add or override entries in ValueEnumerations to open new discrete value sets (learning rates, optimizer choices, etc.).
Guidance — swap in your own mutation proposal system by implementing a propose_mutation method similar to GFSLSupervisedGuide.
Scoring — pass a custom score_aggregator into RealTimeEvaluator to change how per-case fitness values are combined.

Because every slot is enumerated and validated, extending the system never introduces invalid states—new concepts simply become additional discrete options inside the GFSL language.

Running the Legacy Examples

To explore the classic demonstrations (without the supervised guide), run:

python -c "from evolvo import example_formula_discovery; example_formula_discovery()"
python -c "from evolvo import example_neural_architecture_search; example_neural_architecture_search()"

They remain useful for validating the baseline evolver and seeing the raw GFSL output traces.

Roadmap Ideas

Gradient-informed mutation proposals (hybrid neuro-evolution).
Additional tensor ops (attention, modern normalization layers).
Benchmarks comparing supervised guidance vs. unguided baselines.
Exporters for ONNX / TorchScript from GFSL neural genomes.

References

GFSL Basic — foundational description of the fixed-slot language.
GFSL Extensibility — discusses type activation, enumerations, and advanced operations.
️papers/ — contains PDF/Markdown reproductions of the above papers along with related notes.

Have fun evolving! Contributions and experiment reports are welcome—open an issue or PR if you push the framework in new directions.***

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
examples		examples
papers		papers
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
evolvo.py		evolvo.py
example.py		example.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Evolvo

Highlights

Installation

Quick Start

Additional Examples

Library Tour

GFSL Core

Evaluation & Search

Guidance Strategies

Operation Extraction

Practical Examples

1. Formula Discovery (Algorithms)

2. Neural Architecture Assembly

3. Ad-hoc Operations (Personalization)

4. Supervised Guided Evolution (From `example.py`)

Customisation & Extensibility

Running the Legacy Examples

Roadmap Ideas

References

About

Uh oh!

Releases

Packages

Languages

License

Geckos-Ink/evolvo

Folders and files

Latest commit

History

Repository files navigation

Evolvo

Highlights

Installation

Quick Start

Additional Examples

Library Tour

GFSL Core

Evaluation & Search

Guidance Strategies

Operation Extraction

Practical Examples

1. Formula Discovery (Algorithms)

2. Neural Architecture Assembly

3. Ad-hoc Operations (Personalization)

4. Supervised Guided Evolution (From example.py)

Customisation & Extensibility

Running the Legacy Examples

Roadmap Ideas

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

4. Supervised Guided Evolution (From `example.py`)

Packages