Frequently Asked Questions

General Questions

What is U-Probe?

U-Probe is a comprehensive tool for designing DNA/RNA probes for various molecular biology applications including FISH, PCR, and sequencing. It automates the entire workflow from target selection to quality-filtered probe generation.

What makes U-Probe different from other probe design tools?

End-to-end workflow: Complete automation from genome to final probes
Highly configurable: YAML-based configuration for any probe design
Quality-focused: Comprehensive attribute calculation and filtering
Flexible design: Support for complex multi-part probes
Multiple applications: FISH, PCR, sequencing, and custom designs
Python API: Programmatic access for pipeline integration

Installation and Setup

Which Python versions are supported?

U-Probe supports Python 3.9 and higher. Python 3.11 is recommended for best performance.

Do I need to install external tools?

Yes, U-Probe requires:

Bowtie2 for sequence alignment
BLAST+ for similarity searches
Jellyfish (optional) for k-mer counting

See the installation guide for details.

Can I use U-Probe on Windows?

Yes, but with limitations. The external bioinformatics tools (Bowtie2, BLAST) need to be installed separately. We recommend using Windows Subsystem for Linux (WSL) or Docker for the best experience.

Can I run U-Probe without installing Python?

Yes! You can create a standalone executable using PyInstaller. See the installation guide for instructions.

Configuration

How do I find the correct gene names for my organism?

Gene names must match those in your GTF annotation file. To find available names:

bash

# Search for a specific gene
grep -i "GAPDH" /path/to/annotation.gtf

# List all gene names
awk '$3=="gene"' /path/to/annotation.gtf | \
grep -o 'gene_name "[^"]*"' | sort | uniq

Can I design probes for multiple species?

Yes! Create separate genome configurations for each species:

yaml

# genomes.yaml
human_hg38:
  fasta: "/data/human/hg38.fa"
  gtf: "/data/human/hg38.gtf"

mouse_mm39:
  fasta: "/data/mouse/mm39.fa" 
  gtf: "/data/mouse/mm39.gtf"

Then use separate protocols for each species or design cross-species probes with appropriate attributes.

How do I design probes for custom genomic regions?

Use coordinate-based extraction:

yaml

extracts:
  target_region:
    source: "genome"
    length: 200
    coordinates:
      - "chr1:1000000-1002000"
      - "chr2:500000-501000"

What's the difference between "exon", "gene", and "genome" extraction?

exon: Extracts from annotated exonic regions only (spliced sequences)
gene: Extracts from entire gene regions including introns
genome: Extracts from specified genomic coordinates

Choose based on your application:

FISH probes: usually "exon" or "gene"
Genomic PCR: "genome" with coordinates
RNA detection: "exon"

Probe Design

How do I design FISH probes?

Basic FISH probe configuration:

yaml

probes:
  fish_probe:
    template: "{target_binding}TTTTTT{fluorophore_site}"
    parts:
      target_binding:
        length: 25
        expr: "rc(target_region[0:25])"
      fluorophore_site:
        expr: "encoding[target]['fluorophore']"

See examples for complete FISH configurations.

How do I design PCR primers?

For PCR primer pairs:

yaml

probes:
  forward_primer:
    template: "{seq}"
    parts:
      seq:
        length: 22
        expr: "target_region[0:22]"
  
  reverse_primer:
    template: "{seq}"
    parts:
      seq:
        length: 22
        expr: "rc(target_region[-22:])"

Can I use custom sequences in my probes?

Yes! Use literal sequences in quotes:

yaml

probes:
  custom_probe:
    template: "{primer}{target_binding}{adapter}"
    parts:
      primer:
        expr: "'ACGTACGT'"  # Fixed sequence
      target_binding:
        length: 25
        expr: "target_region[0:25]"
      adapter:
        expr: "'TGCATGCA'"

How do I reference other probes in expressions?

Use the probe name in expressions:

yaml

probes:
  probe_1:
    template: "{seq}"
    parts:
      seq:
        expr: "target_region[0:20]"
  
  probe_2:
    template: "{partial}"
    parts:
      partial:
        expr: "probe_1[5:15]"  # Uses part of probe_1

Quality Control

What quality metrics should I use?

Essential attributes for most applications:

yaml

attributes:
  gc_content:
    target: main_probe
    type: gc_content
  melting_temp:
    target: main_probe
    type: annealing_temperature
  off_targets:
    target: main_probe
    type: n_mapped_genes
    aligner: bowtie2
  secondary_structure:
    target: main_probe
    type: self_match

How do I set appropriate filter thresholds?

Start with wide ranges and tighten based on results:

yaml

post_process:
  filters:
    # Start relaxed
    gc_content:
      condition: "gc_content >= 0.3 & gc_content <= 0.7"
    
    # Then tighten for final design
    # gc_content:
    #   condition: "gc_content >= 0.45 & gc_content <= 0.55"

Use the --raw flag to examine distributions before setting final thresholds.

Why are all my probes being filtered out?

Common causes:

Too strict filters: Relax conditions temporarily
Failed attribute calculations: Check for missing indices or files
Inappropriate probe design: Verify expressions are valid
Target region issues: Try different extraction parameters

Use uprobe --verbose run -p protocol.yaml -g genomes.yaml --raw to diagnose.

Performance

How can I speed up probe design?

Increase threads: uprobe run -t 16
Use efficient extraction: "exon" is faster than "gene"
Reduce expensive attributes: Skip fold_score and kmer_count for initial designs
Process in batches: Split large target lists

yaml

# Fast configuration
extracts:
  target_region:
    source: "exon"
    length: 100

attributes:
  # Keep only essential fast attributes
  gc_content:
    target: main_probe
    type: gc_content

How much memory does U-Probe need?

Memory usage depends on:

Genome size (human genome ~8GB for indices)
Number of targets (1000 genes ~1-2GB)
Sequence lengths and overlap
Attribute calculations

For large genomes with many targets, consider 16GB+ RAM.

Can I run U-Probe on a cluster?

Yes! U-Probe is designed for cluster usage:

bash

# SLURM example
#SBATCH --cpus-per-task=16
#SBATCH --mem=32G

uprobe run -p protocol.yaml -g genomes.yaml -t 16

Output and Results

What do the output columns mean?

Standard columns include:

target: Target gene identifier
target_region: Extracted genomic sequence
[probe_name]: Designed probe sequences
[attribute_name]: Calculated quality metrics
chromosome, start, end: Genomic coordinates

How do I interpret quality metrics?

Metric	Good Range	Notes
gc_content	0.4-0.6	Higher = stronger binding, harder to denature
melting_temp	50-65°C	Depends on application temperature
self_match	<0.7	Lower = less secondary structure
n_mapped_genes	≤5	Lower = more specific

Can I export results in other formats?

U-Probe outputs CSV files which can be easily converted:

python

import pandas as pd

# Read CSV
df = pd.read_csv('results/probes.csv')

# Export to other formats
df.to_excel('probes.xlsx', index=False)
df.to_json('probes.json', orient='records')
df.to_parquet('probes.parquet')

How do I select the best probes from results?

python

import pandas as pd

df = pd.read_csv('results/probes.csv')

# Top 5 probes per target by melting temperature
best_probes = (df.sort_values(['target', 'melting_temp'])
                .groupby('target')
                .head(5))

# Filter by multiple criteria
high_quality = df[
    (df['gc_content'] >= 0.45) & 
    (df['gc_content'] <= 0.55) &
    (df['melting_temp'] >= 55) &
    (df['off_targets'] <= 3)
]

Integration

Can I use U-Probe in my Python pipeline?

Yes! Use the Python API:

python

from uprobe import UProbeAPI

uprobe = UProbeAPI(protocol_dict, genomes_dict, output_dir)
results = uprobe.run_workflow()

# Process results with pandas
filtered_results = results[results['gc_content'] > 0.5]

How do I integrate U-Probe with other tools?

U-Probe works well with:

Primer3: Import U-Probe designs for primer optimization
OligoAnalyzer: Validate secondary structures
BLAST: Additional specificity checking
Custom pipelines: Use CSV outputs as input to downstream tools

Can I run U-Probe in Docker?

Yes! Create a Dockerfile:

dockerfile

FROM python:3.11

RUN apt-get update && apt-get install -y bowtie2 ncbi-blast+

COPY . /app
WORKDIR /app
RUN pip install .

ENTRYPOINT ["uprobe"]

Applications

What applications is U-Probe suitable for?

FISH: Fluorescence in situ hybridization probes
PCR: Primer design for amplification
qPCR: Quantitative PCR probes and primers
Sequencing: Capture probes for targeted sequencing
Microarrays: Oligonucleotide probe design
Biosensors: Detection probe design
Custom: Any application requiring designed oligonucleotides

Can U-Probe design riboprobes for RNA ISH?

While U-Probe designs DNA probes, you can adapt the output for riboprobe synthesis by:

Designing DNA probes with U-Probe
Adding T7/T3/SP6 promoter sequences
Using the sequences for in vitro transcription

Is U-Probe suitable for clinical applications?

U-Probe is a research tool. For clinical applications:

Validate all designs experimentally
Follow relevant regulatory guidelines
Consider using established clinical probe sets
Implement additional quality controls

Getting More Help

Where can I find more examples?

Check the examples section
Browse the GitHub repository examples folder
Look at test configurations in tests/data/

How do I contribute to U-Probe?

See the contributing guide for:

Reporting bugs
Requesting features
Contributing code
Improving documentation

Where do I report bugs or request features?

Bugs: GitHub Issues
Feature requests: GitHub Discussions
General questions: GitHub Discussions

How often is U-Probe updated?

U-Probe is actively maintained with:

Bug fixes as needed
Regular feature updates
Security patches
Documentation improvements

Check the changelog for recent updates.

Can I get commercial support?

U-Probe is open-source software. For commercial support or custom development:

Contact the development team through GitHub
Consider hiring contributors for consulting
Explore academic collaborations

Troubleshooting Questions

Why is my installation failing?

Common solutions:

Update pip: pip install --upgrade pip
Use virtual environment
Install system dependencies first
Check Python version (≥3.9 required)

See troubleshooting for detailed help.

Why can't U-Probe find my genes?

Check gene names match GTF file exactly
Try different name formats (symbol, Ensembl ID, etc.)
Verify GTF file format and encoding
Use case-sensitive matching

Why is probe design so slow?

Reduce target list size
Increase thread count
Skip expensive attributes initially
Use faster extraction methods
Consider hardware limitations

See troubleshooting for performance optimization tips.

Still Have Questions?

If your question isn't answered here:

Check the complete documentation
Search existing GitHub issues and discussions
Ask on GitHub Discussions
Review the troubleshooting guide

The U-Probe community is here to help!

Frequently Asked Questions ​

General Questions ​

What is U-Probe? ​

What makes U-Probe different from other probe design tools? ​

Installation and Setup ​

Which Python versions are supported? ​

Do I need to install external tools? ​

Can I use U-Probe on Windows? ​

Can I run U-Probe without installing Python? ​

Configuration ​

How do I find the correct gene names for my organism? ​

Can I design probes for multiple species? ​

How do I design probes for custom genomic regions? ​

What's the difference between "exon", "gene", and "genome" extraction? ​

Probe Design ​

How do I design FISH probes? ​

How do I design PCR primers? ​

Can I use custom sequences in my probes? ​

How do I reference other probes in expressions? ​

Quality Control ​

What quality metrics should I use? ​

How do I set appropriate filter thresholds? ​

Why are all my probes being filtered out? ​

Performance ​

How can I speed up probe design? ​

How much memory does U-Probe need? ​

Can I run U-Probe on a cluster? ​

Output and Results ​

What do the output columns mean? ​

How do I interpret quality metrics? ​

Can I export results in other formats? ​

How do I select the best probes from results? ​

Integration ​

Can I use U-Probe in my Python pipeline? ​

How do I integrate U-Probe with other tools? ​

Can I run U-Probe in Docker? ​

Applications ​

What applications is U-Probe suitable for? ​

Can U-Probe design riboprobes for RNA ISH? ​

Is U-Probe suitable for clinical applications? ​

Getting More Help ​

Where can I find more examples? ​

How do I contribute to U-Probe? ​

Where do I report bugs or request features? ​

How often is U-Probe updated? ​

Can I get commercial support? ​

Troubleshooting Questions ​

Why is my installation failing? ​

Why can't U-Probe find my genes? ​

Why is probe design so slow? ​

Still Have Questions? ​

Frequently Asked Questions

General Questions

What is U-Probe?

What makes U-Probe different from other probe design tools?

Installation and Setup

Which Python versions are supported?

Do I need to install external tools?

Can I use U-Probe on Windows?

Can I run U-Probe without installing Python?

Configuration

How do I find the correct gene names for my organism?

Can I design probes for multiple species?

How do I design probes for custom genomic regions?

What's the difference between "exon", "gene", and "genome" extraction?

Probe Design

How do I design FISH probes?

How do I design PCR primers?

Can I use custom sequences in my probes?

How do I reference other probes in expressions?

Quality Control

What quality metrics should I use?

How do I set appropriate filter thresholds?

Why are all my probes being filtered out?

Performance

How can I speed up probe design?

How much memory does U-Probe need?

Can I run U-Probe on a cluster?

Output and Results

What do the output columns mean?

How do I interpret quality metrics?

Can I export results in other formats?

How do I select the best probes from results?

Integration

Can I use U-Probe in my Python pipeline?

How do I integrate U-Probe with other tools?

Can I run U-Probe in Docker?

Applications

What applications is U-Probe suitable for?

Can U-Probe design riboprobes for RNA ISH?

Is U-Probe suitable for clinical applications?

Getting More Help

Where can I find more examples?

How do I contribute to U-Probe?

Where do I report bugs or request features?

How often is U-Probe updated?

Can I get commercial support?

Troubleshooting Questions

Why is my installation failing?

Why can't U-Probe find my genes?

Why is probe design so slow?

Still Have Questions?