TGTGInsighttelegram intelligenceLIVE / telegram public index
Retour aux chaînes
Graph Machine Learning avatar

TGINSIGHT CHAT

Graph Machine Learning

@graphml

Technologies

Everything about graph theory, computer science, machine learning, etc. If you have something worth sharing with the community, reach out @gimmeblues, @chaitjo. Admins: Sergey Ivanov; Michael Galkin; Chaitanya K. Joshi

Abonnés6,750Abonnés actuels de la chaîne
Posts indexés877Nombre de posts indexés
Portée récente67,390Somme des vues récentes
Posts récents

Posts récents

Page 10 sur 74 · 877 posts

Publié 7 oct.

​​Graph ML News (Oct 7th) - FoldFlow, Iambic round, Google’s Graph Mining Library Although ICLR submissions are still not available, October brings some other news! 🌊 Flow Matching is the generative modeling framework of 2023 (and you’ll hear it everywhere in 2024) that is taking the Geometric DL world step by step. While diffusion models can only generate from a Gaussian prior, Flow Matching generative models can take any prior distribution. The seminal paper by Alex Tong et al made huge advancements in the Continuous Normalizing Flows, conditional flow matching, and optimal transport for flow matching (here is the LoGG reading group talk), and we’ll see a good bunch of generative models for molecules and proteins based on this framework. A few days ago, the DreamFold team from Mila led by Joey Bose and Tara Akhound-Sadegh together with Michael Bronstein (and with Alex Tong) released FoldFlow, an SE(3) equivariant flow matching model for protein backbone generation. Perhaps the coolest result is in the attached figure - whereas AlphaFold 2 can only discover one energy state of the protein structure, FoldFlow captures all modes of the distribution which increases diversity of generated samples. Hope to hear more from folks at DreamFold in future! 🧬 Iambic Therapeutics (former Entos) raised $100M Series B to advance their drug discovery platform. Iambic identified 2 drug candidates (apparently preliminary trials look ok) and is active in the academic environment, ie, the team created OrbNet and recent NeuralPlexer, an equivariant diffusion model for protein-ligand docking. ⛏️ Google open-sourced the Graph Mining library in C++ with scalable and parallel graph clustering algorithms including the recent ParHAC from NeurIPS’22 that processed a 154 billion edges graph in 3 hours. No graph is too large for Google. 🍧 Floris Geerts (University of Antwerp) gave a Richard M. Karp distinguished lecture at the Simons institute on “The Power of Graph Learning” focusing on theoretical aspects of GNNs expressiveness, and explaining the idea of Graph Embedding Language (GEL) that bridges a gap between GNNs and databases. While the GEL paper is in the works, there is a nice slide deck about it. Weekend reading: SE(3)-Stochastic Flow Matching for Protein Backbone Generation - FoldFlow Equivariant flow matching by Leon Klein, Andreas Krämer, Frank Noé - to complete the equivariant flow matching picture SaProt: Protein Language Modeling with Structure-aware Vocabulary by Su et al - pretty massive gains over ESM-2 in the protein structure awareness, 650M params trained on 64 A100 for 3 months. The code is already available on GitHub Cooperative Graph Neural Networks by Ben Finkelshtein et al - a new look at the message passing procedure where nodes can “decide” whether to propagate neighbors messages, send own messages, or remain silent.

5,130 views

Publié 3 oct.

GraphML News (Oct 3rd) Well, no big news from the past weekend since ICLR’24 submissions are still not available after the main deadline 🙁 At least we can read the abstracts of all accepted NeurIPS’23 papers here. A brief search indicates that the amount of papers with “diffusion” (192) is as large as “graph” papers (202). Meanwhile, VantAI launches a monthly lecture series on Generative AI in Drug Discovery hosted by Michael Bronstein and Bruno Correia. The inaugural meeting will be held this Friday, October 6, at 11 am ET / 5 pm CET. Free to join using the links provided. A few fresh software releases: PyDGN got updated to 1.5, and industry-grade GraphStorm released v0.2 featuring better support for distributed training on GPUs. Paper reading: Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems (NeurIPS’23) by Google on featurization strategies for ML in search, ads, and recsys. Limits, approximation and size transferability for GNNs on sparse graphs via graphops (NeurIPS’23) by Thien Le and Stefanie Jegelka on size generalization in GNNs. Sheaf Hypergraph Networks (NeurIPS’23) by Iulia Duta et al (math alert 🤯) On the Power of the Weisfeiler-Leman Test for Graph Motif Parameters by Matthias Lanzinger and Pablo Barceló

5,180 views

Publié 23 sept.

GraphML News (Sep 23rd) - Stanford Graph Learning Workshop, AlphaMisuse, PEFT for ESM NeurIPS decisions for both tracks are out - congrats to those who made it in and encouragements to those who did not, hopefully the next iteration would get better! Our team got 2 papers accepted including A*Net - a scalable knowledge graph reasoning method that can be used, eg, for improving factual correctness of language models (demo is on github). Next weeks we can expect more accepted papers to be publicly available, so we’ll keep you updated. Don’t forget about the NeurIPS graph workshops many of which extended their deadlines to early October! Stanford Graph Learning Workshop was officially announced and will take place physically on Oct 24th. This time the organizers published a call for contributed talks from the academic and industry tracks. I will try to be there, ping me if you want to chat. Google DeepMind announced AlphaMisuse, a model for categorizing “missense” genetic mutations based on AlphaFold. AlphaMisuse predicted labels for ~60M possible missense mutations whereas humans covered at most ~700K. Unfortunately, the authors say the model weights won’t be released so let’s hope for re-implementations in open source ecosystems. If you have been living under the rock, parameter-efficient fine-tuning (PEFT) techniques took the world of LLMs by the storm and it’s pretty much everywhere now. Amelie Schreiber wrote a great blogpost on applying LoRA to the ESM-2 family of protein LMs so even the beefiest of ESMs (still pretty small compared to Llama’s though) can be now fine-tuned on commodity GPUs. To learn more about PEFT, check out this fresh survey by Vladislav Lialin et al. Some freshly accepted NeurIPS papers for the weekend reading: Implicit Transfer Operator Learning: Multiple Time-Resolution Surrogates for Molecular Dynamics SE(3) Equivariant Augmented Coupling Flows When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability Fine-grained Expressivity of Graph Neural Networks Next week ICLR’24 submissions become available, so oh boy we’ll have the weekend reading 👀

6,020 views

Publié 16 sept.

Graph ML News (Sep 16th) - Breakthrough Prize, OpenCatalyst cases, Illustrated Cats, EvoDiff 🏆 The Breakthrough Prize winners aka “Oscars of Science” were announced earlier this week (Ig Nobel Prizes were announced as well but that’s a story for another fun time) and they do have a nice connection to Geometric DL! The Math prize went to Simon Brendle (Columbia) for “transformative contributions to differential geometry, including sharp geometric inequalities, many results on Ricci flow and mean curvature flow and the Lawson conjecture on minimal tori in the 3-sphere.” Ricci flows played a key role in understanding theoretical capabilities of GNNs in the seminal paper by Topping et al that received ICLR 2022 Outstanding Award and spun off more research of differential geometry and GNNs. Perfect time to jump on the Ricci flowwagon (pun intended). Do check other winners, their research is very cool as well. 🧪 The OpenCatalyst team published two case studies how the OCP demo helped in the scientific research of catalyst discovery: for the nitrogen reduction reaction (NRR) and for hydrogen fuel cells. OpenCatalyst turns into smth like AlphaFold but for materials science and chemistry. 😼 Finally, check out the Category Theory Illustrated book by Boris Marinov - this perhaps the most visual resource to understand the basics of Category Theory. As of now, 6 chapters are ready — on Sets, Categories, Monoids, Order, Logic, and Functors. Don’t forget about Cats4AI to learn more about Category Theory applied to ML and GNNs. 🧬 MSR AI4Science released EvoDiff - a massive work on the discrete diffusion generative model for conditional generation of protein sequences. EvoDiff was designed for sequences and MSAs and ships in two sizes — 38M and 640M params so it would fit on a variety of GPUs. Some weekend reading: Protein generation with evolutionary diffusion: sequence is all you need - introducing EvoDiff Graph Neural Networks Use Graphs When They Shouldn't by Bechler-Speicher et al. - one more evidence for graph rewiring Is Solving Graph Neural Tangent Kernel Equivalent to Training Graph Neural Network? by Qin et al - for all you hardcore theory lovers on the channel

5,720 views

Publié 9 sept.

Graph ML News (Sep 9th) The upcoming ICLR deadline and LOG reviewing period seem to keep the community busy and reduce the amount of news content this week. We’ll compensate for that the day ICLR submissions are on OpenReview 😉 The local LoG meetup in Trento will take place on November 27th-30th (together with the main conference held online and fully remotely). There is a handful of local meetups already (if I remember correctly, other locations include UK, Germany, Canada, and a few in the US). Actually, it might be a good time for the LOG organizers to publish the confirmed ones. The GAIN workshop on explainability and applicability of GNNs took place this week (Sept 6-8th), waiting for the recordings! Weekend reading: RetroBridge: Modeling Retrosynthesis with Markov Bridges by Ilia Igashov, Arne Schneuing, Marwin Segler, Michael Bronstein, Bruno Correia — a new generative framework for template-free retrosynthesis with some math traces of discrete diffusion Where Did the Gap Go? Reassessing the Long-Range Graph Benchmark by Jan Tönshoff, Martin Ritzert, Eran Rosenbluth, Martin Grohe — turns out some hyperparameters tinkering can boost baseline performance on LRGB! Using Multiple Vector Channels Improves E(n)-Equivariant Graph Neural Networks by Levy, Kaba, et al - a simple and inexpensive multi-channel trick to boost EGNNs A few theory papers: Representing Edge Flows on Graphs via Sparse Cell Complexes by Josef Hoppe, Michael T. Schaub Unifying over-smoothing and over-squashing in graph neural networks: A physics informed approach and beyond by Shao et al.

5,830 views

Publié 2 sept.

Graph ML News (Sep 2nd) - TpuGraphs Kaggle competition, EvolutionaryScale Google launched a proper graph learning Kaggle competition ”Fast or Slow?” with a $50k prize pool. The challenge is based off a recently released TpuGraphs dataset — given a computational graph (as a DAG), predict its runtime given a certain input configuration (on node- or graph-level) and get the fastest config. Practically, it can be framed as a regression or ranking problem. TpuGraphs is pretty large: 7k nodes / 31M configuration pairs for the layout collection, and 40 nodes / 13M pairs for the tile collection. Baselines include GCN and GraphSAGE, but we can probably expect Kaggle grandmasters to come up with creative gradient boosting and decision trees techniques as well 😉 So XGBoost or GNNs? The challenge is open until Nov 17th. A few weeks ago we found out that Meta disbanded the protein team working on ESM, ESMFold, and a handful of other projects. Now we know that the ESM team formed EvolutionaryScale and raised about $40M of funding promising new versions of ESM every year. Great news for thousands of protein projects using ESM models! Weekend reading: TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs Exploring "dark matter" protein folds using deep learning feat. Andreas Loukas, Michael Bronstein, and Bruno Correia

6,570 views

Publié 26 août

Graph ML News (Aug 25th) The autumn edition of the Molecular ML Conference (MoML) going to take place on Nov 8th at MIT. MoML is a premier venue for bringing together graph learning and life sciences crowd including computation biology, drug discovery, computational chemistry, molecular simulation, and many more. Submit a poster until Oct 13th! Not an official announcement, but there are rumors that the Stanford Graph Learning Seminar will return on Oct 11th as well 😉 Expect a flurry of ICLR submissions in the next weeks before the deadline, but meanwhile the weekend reading is: UGSL: A Unified Framework for Benchmarking Graph Structure Learning by Google Research feat. Bahare Fatemi, Anton Tsitsulin and Bryan Perozzi Simulate Time-integrated Coarse-grained Molecular Dynamics with Multi-scale Graph Networks feat. Xiang Fu, Tommi Jaakkola Approximately Equivariant Graph Networks by Teresa Huang, Ron Levie, and Soledad Villar Will More Expressive Graph Neural Networks do Better on Generative Tasks? (spoiler alert: nopes) feat. Pietro Liò The Expressive Power of Graph Neural Networks: A Survey

5,940 views

Publié 19 août

Graph ML News (Aug 18th) A new blog post Designing Deep Networks to Process Other Deep Networks by Haggai Maron, Ethan Fetaya, Aviv Navon, Aviv Shamsian, Idan Achituve, and Gal Chechik applies the concepts of symmetry and invariances (common tools in Geometric DL) to the task of predicting model weights. Working in the Deep Weight Space (all parameters of neural networks), we want neural architectures to be invariant to permutations of neurons because mathematically any permutation should still encode the same function. Two papers appeared almost simultaneously, PoseBusters by Buttenschoen et al and PoseCheck by Harris et al, providing a critical look on modern generative models (often diffusion-based) for protein-ligand docking and structure-based drug design. PoseBusters finds that generative models often have problems with physical plausibility of the generated outputs while PoseCheck finds many nonphysical features in generated molecules and poses. Huge opportunities for improving equivariant diffusion models! The Simons Institute for the Theory of Computing held a workshop on large language models and transformers. It was not very much into graph learning but still featured a handful of talks on core topics that will be in graph ML sooner or later. Featuring talks by Chris Manning, Yejin Choi, Ilya Sutskever, Sasha Rush, and other famous researchers — the playlist with recorded talks is already on YouTube 👀 Weekend reading: Score-based Enhanced Sampling for Protein Molecular Dynamics feat. Jian Tang - a score-based model for approximating MD calculations.

5,580 views

Publié 12 août

Graph ML News (Aug 12th) - ESM Disbandment, KDD’23, LoG’23 😮 The ESM team at Meta AI has been disbanded to a large surprise of the community - the suite of ESM protein language models (ESM-1, ESM-2) and ESMFold became very popular in the protein representation and generation, and things looked promising upon the release of the ESM Metagenomic Atlas with 600M+ protein structures. Some rumors say the team would continue working on the ESM stack at another place, so we’ll keep an eye on their next steps. KDD’23 has just finished in Long Beach - perhaps it is the most graph-packed data mining conference featuring 3 workshops and 10 tutorials on Graph ML topics. The proceedings are already available and full of graph papers. I attended the Graph Learning Benchmarks workshop last Sunday to participate in the panel discussion, met old and new friends, and enjoyed a less crowded venue than ICML (still socially drained after Hawaii though). The submission deadline for the best Graph ML conference Learning on Graph 2023 (LoG) is Aug 21st (AoE) and approaching — consider submitting if you didn’t like savage NeurIPS strong reject reviews 👺. For me, the LoG reviewing (both as an author and reviewer) and conference experience was the best in 2022, highly recommend! Weekend reading: AbDiffuser: Full-Atom Generation of In-Vitro Functioning Antibodies feat. Kyunghyun Cho and Andreas Loukas — a continuous (atom coordinates) and discrete (residue types) diffusion model for generating antibodies. “Laboratory experiments confirm that all 16 HER2 antibodies discovered were expressed at high levels and that 57.1% of selected designs were tight binders” 👀. Augmenting Recurrent Graph Neural Networks with a Cache feat. Nesreen Ahmed — introduces CacheGNNs with memory, sets a new SOTA (with a significant margin) on the Peptides-struct graph regression problem of the Long-Range Graph Benchmark. VQGraph: Graph Vector-Quantization for Bridging GNNs and MLPs feat. Jure Leskovec Diffusion Denoised Smoothing for Certified and Adversarial Robust Out-Of-Distribution Detection feat. Stephan Günnemann

5,320 views

Publié 6 août

Graph Machine Learning @ ICML 2023 Just finished a new Medium post summarizing Graph ML papers seen at ICML 2023 with some additional photos from Hawaii to make the text less boring 😉 What you can find inside: - Graph Transformers: Sparser, Faster, and Directed - Theory: VC dimension of GNNs, deep dive in over-squashing - New GNN architectures: delays and half-hops - Generative Models - Stable Diffusion for Molecules, Discrete diffusion - Geometric Learning: Geometric WL, Clifford Algebras - Molecules: 2D-3D pretraining, Uncertainty Estimation in MD - Materials & Proteins: CLIP for proteins, Ewald Message Passing, Equivariant Augmentations - Cool Applications: Algorithmic reasoning, Inductive KG completion, GNNs for mass spectra

5,530 views

Publié 30 juil.

Graph ML News (July 30th) - ICML and Open Catalyst Demo The ICML week has finally passed with yesterdays’ workshops. Meeting the graph learning community was a blast and I am looking forward seeing you guys and gals at NeurIPS or already in Vienna at the next ICLR and ICML. The review post of most interesting graph papers at ICML is on the way 😉 Meanwhile, Meta AI and CMU released the Open Catalyst Demo - a website where you can play around with relaxations (DFT approximations) of 11.5k catalyst materials on 86 adsorbates in 100 different configurations each (making it up to 100M combinations). The demo is powered by SOTA geometric models GemNet-OC and Equiformer-V2. Hopefully the demo will grow up to something as large and popular as AlphaFold DB (but for materials)! The GAIN community in Germany hosts the Workshop on Explainability and Applicability of Graph Neural Networks to be held in Kassel on September 6-8th. The workshop will feature invited talks by Christopher Morris, Soledad Villar, Petar Veličković, and Emanuele Rossi.

5,210 views

Publié 22 juil.

Graph ML News (July 22nd) - ICML’23, AI for Science survey ICML time! Michael will be representing the Graph ML channel in the infamous, 3-of-a-kind, limited edition t-shirt, drop him a line if you’d like to chat. Big labs started to announce their presence and accepted papers (not just graph papers though), eg, Google DeepMind, Meta AI, Amazon, Microsoft, Apple. If you didn’t make it to ICML this year, consider a fresh selection of the weekend reading: 📚Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems by Xuan Zhang and 60+ famous authors is a massive 260-page survey on geometric models in scientific applications spanning molecules, proteins, quantum mechanics, PDEs, and materials discovery. Contextualizing Protein Representations Using Deep Learning on Protein Networks and Single-Cell Data by Michelle M Li et al from Marinka Zitnik’s lab at Harvard. Quote: “We introduce PINNACLE, a flexible geometric deep learning approach that is trained on contextualized protein interaction networks to generate context-PINNACLE protein representations. Leveraging a human multi-organ single-cell transcriptomic atlas, PINNACLE provides 394,760 protein representations split across 156 cell type contexts from 24 tissues and organs.”

5,360 views
12•••5•••89101112•••15•••20•••25•••30•••35•••40•••45•••50•••55•••60•••65•••70•••7374