Miroslav Lzicar
Research engineer building AI/ML for science and fun.
CTO & Co-Founder at Deep MedChem. · About Me
Publications
PublicationNov 15, 2025SynthonGPT: Diversity-Oriented Retrieval in Ultra-Large Enumerated Chemical SpacesLžičař, M. · Tech report (2025)A compact synthon-conditioned transformer for navigating makeable chemical space, grounded in vendor enumerations rather than hallucinated SMILES.
PublicationNov 09, 2025CellARC: Measuring Intelligence with Cellular AutomataLžičař, M. · arXiv (2025)Introducing a cellular-automata-native benchmark, leaderboard, and dataset suite that scores reasoning systems on metadata-rich ARC-AGI style episodes.
PublicationOct 23, 2025BitBIRCH-Lean: Chemical Space in the Palm of Your WorkstationLžičař, M. et al. · bioRxiv (2025)Ultra-lean BitBIRCH rewrite that clusters hundred-million scale molecular libraries with on-the-fly compression, optional C++ accelerators, and a batteries-included CLI.
Research Notes
Research NoteJan 30, 2026Winning the 9ADD DYRK1B Virtual Screening ChallengeLžičař, M.; Skach, K.; Korolov, A. · 9th Advanced in silico Drug Design Workshop (2026)Our team at Deep MedChem won the DYRK1B challenge at 9ADD 2026 with a CHEESE ShapeSim + LightGBM workflow. This short note explains what mattered and includes the final presentation.
Research NoteNov 06, 2025Soft Substructure Matching with Differentiable Graph AlignmentA quick note on adapting ISONET’s Gumbel–Sinkhorn edge alignment to molecular substructure search, with links to the source notebook and paper.
Research NoteAug 14, 2025Enumerative vs. Synthon-Based Molecule SearchThere are two distinct technologies to search in databases of molecules. This post explains both with intuitive LEGO illustrations and presents a practical guide on when to use each.
Research NoteAug 08, 2025Do Stereoisomers Cause Leakage? A Closer Look at the LIT-PCBA AuditA recent paper claims an established virtual screening benchmark to be 'fundamentally compromised' due to 'egregious data leakage, rampant duplication, and pervasive analog redundancy', rendering it 'unfit for its intended purpose' and 'calling into question previous results based on its use'. The catch: the authors canonicalized to non-isomeric SMILES, effectively collapsing stereoisomers into duplicates.
Research NoteJun 16, 2025CHEESE Search: A Billion-Scale Vector Molecular SearchThis is my billion-scale vector engine for 3D shape & electrostatic molecular similarity. It turns ligand-based screening into nearest-neighbour lookups—so you can *search, rank, and visualise* huge chemical spaces in seconds.
Research NoteNov 13, 2024What Can Neural Network Embeddings Do That Fingerprints Can’t?Fingerprints have long been the standard for representing molecules, but neural network embeddings are opening doors to new possibilities. This post explores where these AI-driven representations excel, from better capturing molecular similarity with metrics to addressing modalities that traditional fingerprints often miss.
Research NoteAug 31, 2024Abstraction and Reasoning with Vision TransformersI tried, and failed, to get ViTs to work on the ARC challenge.
