Miroslav Lzicar

Research engineer building AI/ML for science and fun.

CTO & Co-Founder at Deep MedChem. · About Me

Publications
  • Thumbnail for SynthonBench: Benchmarking Sample-Efficient Optimization in Combinatorial Chemical Spaces
    PublicationJun 10, 2026
    SynthonBench: Benchmarking Sample-Efficient Optimization in Combinatorial Chemical Spaces
    Accepted to ICML 2026 GenBio Workshop
    Synthesis-on-demand libraries expose vast chemical spaces through reaction templates and compatible synthons, so the key question is which candidates to evaluate under a small oracle budget. SynthonBench introduces reaction-native, budgeted search tasks where optimizers propose valid reaction/synthon tuples and are scored with top-k recall, enrichment, regret, and Pareto metrics. Accepted to the ICML 2026 GenBio Workshop.
    arXiv soon
  • Thumbnail for Factorized Search and Cartography of Synthon-Based Chemical Spaces
    PublicationJun 09, 2026
    Factorized Search and Cartography of Synthon-Based Chemical Spaces
    Accepted to ICML 2026 GenBio Workshop
    Make-on-demand design spaces are increasingly defined by reactions and synthons rather than enumerated molecules, which makes search, audit, and visualization a grounding problem. This work presents SynthonOR for query-conditioned factorized retrieval and SynthonMap for route-aware sampling, embedding, and product-mass cartography. Accepted to the ICML 2026 GenBio Workshop.
    arXiv soon
  • Thumbnail for Auditing Neural Thickets with Low-Rank Routes
    PublicationJun 09, 2026
    Auditing Neural Thickets with Low-Rank Routes
    Accepted to ICML 2026 WSS Workshop
    Neural thickets suggest that many task experts may lie dense around pretrained weights, but density does not mean those experts are easy to discover. This audit tests whether nearby experts can be found through tiny learned low-rank routes under deployable selection and matched total query budgets, producing a mixed but useful probe of selector-sensitive neighborhoods. Accepted to the ICML 2026 Weight-Space Symmetries Workshop.
    arXiv soon
  • Thumbnail for SynthonOR: Simple Open-Source Synthon Similarity Search
    PublicationApr 24, 2026
    SynthonOR: Simple Open-Source Synthon Similarity Search
    Lžičař, M. · Open-source release (2026)
    A lightweight Python package for reaction-grounded synthon retrieval that stays intentionally simple: generic RDKit fingerprints combined with a binary OR.
  • Thumbnail for SynthonGPT: Diversity-Oriented Retrieval in Ultra-Large Enumerated Chemical Spaces
    PublicationNov 15, 2025
    SynthonGPT: Diversity-Oriented Retrieval in Ultra-Large Enumerated Chemical Spaces
    Lžičař, M. · Tech report (2025)
    A compact synthon-conditioned transformer for navigating makeable chemical space, grounded in vendor enumerations rather than hallucinated SMILES.
  • Thumbnail for CellARC: Measuring Intelligence with Cellular Automata
    PublicationNov 09, 2025
    CellARC: Measuring Intelligence with Cellular Automata
    Lžičař, M. · arXiv (2025)
    Introducing a cellular-automata-native benchmark, leaderboard, and dataset suite that scores reasoning systems on metadata-rich ARC-AGI style episodes.
  • Thumbnail for BitBIRCH-Lean: Chemical Space in the Palm of Your Workstation
    PublicationOct 23, 2025
    BitBIRCH-Lean: Chemical Space in the Palm of Your Workstation
    Lžičař, M. et al. · bioRxiv (2025)
    Ultra-lean BitBIRCH rewrite that clusters hundred-million scale molecular libraries with on-the-fly compression, optional C++ accelerators, and a batteries-included CLI.
  • Thumbnail for CHEESE: 3D Shape and Electrostatic Virtual Screening in a Vector Space
    PublicationNov 05, 2024
    CHEESE: 3D Shape and Electrostatic Virtual Screening in a Vector Space
    Lžičař, M. and Gamouh, H. · ChemRxiv (2024)
    A novel approach to virtual screening using 3D shape and electrostatics in a vector space.
Research Notes
  • Thumbnail for Electrostatic Embeddings Found NS1: A Molecule That Made It Into an Animal Model
    Research NoteJun 10, 2026
    Electrostatic Embeddings Found NS1: A Molecule That Made It Into an Animal Model
    A CHEESE case study where 3D electrostatic similarity prioritized NS1, a tetrazolic neurosteroid candidate, before synthesis. NS1 later showed strong activity and advanced to in vivo validation in an osteoarthritis pain model.
  • Thumbnail for Winning the 9ADD DYRK1B Virtual Screening Challenge
    Research NoteJan 30, 2026
    Winning the 9ADD DYRK1B Virtual Screening Challenge
    Lžičař, M.; Skach, K.; Korolov, A. · 9th Advanced in silico Drug Design Workshop (2026)
    Our team at Deep MedChem won the DYRK1B challenge at 9ADD 2026 with a CHEESE ShapeSim + LightGBM workflow. This short note explains what mattered and includes the final presentation.
  • Thumbnail for Soft Substructure Matching with Differentiable Graph Alignment
    Research NoteNov 06, 2025
    Soft Substructure Matching with Differentiable Graph Alignment
    A quick note on adapting ISONET’s Gumbel–Sinkhorn edge alignment to molecular substructure search, with links to the source notebook and paper.
  • Thumbnail for Enumerative vs. Synthon-Based Molecule Search
    Research NoteAug 14, 2025
    Enumerative vs. Synthon-Based Molecule Search
    There are two distinct technologies to search in databases of molecules. This post explains both with intuitive LEGO illustrations and presents a practical guide on when to use each.
  • Thumbnail for Do Stereoisomers Cause Leakage? A Closer Look at the LIT-PCBA Audit
    Research NoteAug 08, 2025
    Do Stereoisomers Cause Leakage? A Closer Look at the LIT-PCBA Audit
    A recent paper claims an established virtual screening benchmark to be 'fundamentally compromised' due to 'egregious data leakage, rampant duplication, and pervasive analog redundancy', rendering it 'unfit for its intended purpose' and 'calling into question previous results based on its use'. The catch: the authors canonicalized to non-isomeric SMILES, effectively collapsing stereoisomers into duplicates.
  • Thumbnail for CHEESE Search: A Billion-Scale Vector Molecular Search
    Research NoteJun 16, 2025
    CHEESE Search: A Billion-Scale Vector Molecular Search
    This is my billion-scale vector engine for 3D shape & electrostatic molecular similarity. It turns ligand-based screening into nearest-neighbour lookups—so you can *search, rank, and visualise* huge chemical spaces in seconds.
  • Thumbnail for What Can Neural Network Embeddings Do That Fingerprints Can’t?
    Research NoteNov 13, 2024
    What Can Neural Network Embeddings Do That Fingerprints Can’t?
    Fingerprints have long been the standard for representing molecules, but neural network embeddings are opening doors to new possibilities. This post explores where these AI-driven representations excel, from better capturing molecular similarity with metrics to addressing modalities that traditional fingerprints often miss.
  • Thumbnail for Abstraction and Reasoning with Vision Transformers
    Research NoteAug 31, 2024
    Abstraction and Reasoning with Vision Transformers
    I tried, and failed, to get ViTs to work on the ARC challenge.