PyValentin is a sophisticated matchmaking system that uses multi-dimensional distance calculations and compatibility filtering to create optimal pairs from survey responses. The system processes raw survey data through several stages, applying mathematical models to quantify compatibility.

Features
Setup
Usage
The Beautiful Beautifly Simple Math Behind PyValentin
File Structure
Configuration
Input Format
Output Files
Customization Guide
Technical Details
Troubleshooting
Contact & Support
License

Features

Multi-dimensional compatibility analysis
Customizable gender/preference filtering
Quality vs. quantity optimization
Interactive GUI with progress tracking
Drag-and-drop file support
Comprehensive results output

Setup and Installation

You Only Need To Download the .ZIP

Ensure Python 3.8+ is installed
Install dependencies:
```
pip install tkinterdnd2 numpy
```
Place your survey data in CSV format
Configure your Config.json and Filter.json files

Usage

Launch the application:
```
python main.py
```
Select your input files:
- Survey responses (CSV)
- Configuration file (JSON)
- Filter rules (JSON)
Adjust the quality-quantity slider
Click "Process Files"
Check the genR folder for results

The Mathematics Behind PyValentin

Current Pair Optimization

For quality weight w ∈ [0,1]:

P(A,B) = w * S(A,B) + (1-w) * M(A,B)
where M(A,B) is 1 if creates new pair, 0 otherwise

1. Data Normalization

Before processing, all categorical survey responses are converted to numerical values using the Config.json mapping. This creates a consistent numerical space for calculations.

2. Distance Calculation

For each pair of users (i,j), we calculate a multi-dimensional Euclidean distance:

Dw(A,B) = √(Σ(wi(Ai - Bi)²)) / √(Σwi)
where wi is importance weight of question i

This produces a distance matrix D where D[i,j] represents how different two users are across all responses.

3. Similarity Transformation

The distance matrix is transformed into a similarity matrix using:

S(A,B) = 1 / (1 + D(A,B))
Range: (0,1] where 1 is perfect match

This creates a normalized similarity score where:

1.0 = perfect match
0.0 = complete mismatch

4. Preference Filtering

The system applies a boolean matrix F where:

F(A,B) = (A.attracted_to ∈ B.gender_identity AND
          B.attracted_to ∈ A.gender_identity) OR
         (8 ∈ {A.gender_identity, A.attracted_to, B.gender_identity, B.attracted_to})

5. Optimal Pairing Algorithm

The system uses a modified stable marriage algorithm with the following steps:

Sort users by number of potential matches
For each user i:
- Find top N matches based on quality weight
- Select best available match j
- Remove both i and j from available pool

Quality weight affects match selection:

High (>0.5): Selects from top 25% of matches
Low (<0.5): Considers up to 75% of matches

6. Second Pass Optimization

For remaining unmatched users:

Create a graph of mutual matches
Find maximal matching using a greedy algorithm
Optimize for global satisfaction using local improvements

Configuration

Config.json

Defines mappings from survey responses to numerical values:

{
     "Response Text": "Numerical Value",

}

Filter.json

Defines preference matching rules:

{
     "filterables": {
          "1": "Male",
          "2": "Female",

     },
     "filters": {
          "5": ["1", "2", "3"], 
          "4": ["1", "2"],      

     }
}

Input Format

Required CSV columns:

Timestamp
Email
Gender (a)
Attracted to (b)
Question responses...

Example:

Timestamp,Email,Gender,Attracted To,Q1,Q2,...
2024-01-01,user@example.com,Male,Female,Response1,Response2,...

Output Files

modified_csv.csv: Normalized survey data
processed_distances.csv: Distance matrix
similarity_list.csv: Similarity scores
filtered_similarity_list.csv: Filtered matches
optimal_pairs.csv: Final pairings
unpaired_entries.csv: Unmatched users

Customization Guide

Modifying Match Criteria

Update Config.json with new response mappings
Modify Filter.JSON matching rules
Adjust weights in Ski.py calculate_distances() function

Adding New Questions

Add response mappings to Config.json
Update CSV processing in FixCSV.py if needed
Modify distance calculation in Ski.py

Custom Filtering Rules

Modify Filter.JSON:

{
    "filterables": {
        "value": "label"
    },
    "filters": {
        "seeker_value": ["acceptable_values"]
    }
}

Technical Details

Distance Calculation

Uses normalized Euclidean distance
Configurable weights per question
Range: 0 (identical) to 1 (maximum difference)

Matching Algorithm

Converts responses to numerical values
Calculates distance matrix
Generates similarity scores
Applies filtering rules
Handles edge cases

Performance

O(n²) complexity for n participants
Memory usage: ~100MB for 1000 participants
Processing time: ~1-2 seconds per 100 participants

Troubleshooting

Common Issues

Missing dependencies
- Run pip install tkinterdnd2 numpy
- Check Python version (3.8+ required)
File format errors
- Verify CSV column order
- Check JSON syntax
- Ensure UTF-8 encoding
No matches found
- Verify filter rules
- Check response mappings
- Confirm gender/attraction values

Debug Mode

Add debug logging:

import logging
logging.basicConfig(level=logging.DEBUG)

Contact & Support

Contact me for info on this project at:
Nagusame CS on Github

License

This project is licensed under the GNU General Public License v3.0 - see below for details:

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program. If not, see <https://www.gnu.org/licenses/>.

Assets 3

Releases: NagusameCS/PyValentin

9.00.0 - UNSTABLE

Uh oh!

8.00.0 - UNSTABLE

Uh oh!

7.00.0 - STABLE

Uh oh!

6.50.0

Uh oh!

PyValentine 6.00.0

Uh oh!

PyValentine 5.83.0

Uh oh!

PyValentin 5.82.2

Uh oh!

PyValentin Version 4.03

PyValentin - Advanced Matchmaking System

Overview

Table of Contents

Features

Setup and Installation

You Only Need To Download the .ZIP

Usage

The Mathematics Behind PyValentin

Current Pair Optimization

1. Data Normalization

2. Distance Calculation

3. Similarity Transformation

4. Preference Filtering

5. Optimal Pairing Algorithm

6. Second Pass Optimization

Configuration

Config.json

Filter.json

Input Format

Output Files

Customization Guide

Modifying Match Criteria

Adding New Questions

Custom Filtering Rules

Technical Details

Distance Calculation

Matching Algorithm

Performance

Troubleshooting

Common Issues

Debug Mode

Contact & Support

License

Uh oh!