Robert Haas

Project Owner

Augmenting BMKGs With Cheminformatics And CADD

Expert Rating

n/a

Type SingularityNET RFP
Funding Request n/a
RFP Guidelines Advanced knowledge graph tooling for AGI systems

Details locked. Check back later to view.
Details locked.Check back later to view.
Details locked.Check back later to view.

Overview

Biomedical knowledge graphs (BMKG) contain chemical compounds such as drugs, toxins, metabolites, cofactors or signaling molecules. These entities and some of their relations can be richly augmented with qualitative & quantitative properties by methods from cheminformatics, computer-aided drug design (CADD) and related fields. This enables numerical queries and many analyses such as filtering, clustering, embedding, similarity/outlier detection, QSAR modeling, ML, etc. The aim of this project is to make existing but scattered methods available in a Python package with a unified functional API, expose it to OpenCog Hyperon, and apply it in a PoC study to annotate and analyze Hetionet in MORK.

Project Tags:
Community and Collaboration

RFP Guidelines

Advanced knowledge graph tooling for AGI systems

Internal Proposal Review

Type SingularityNET RFP
Total RFP Funding $350,000 USD
Proposals 39
Awarded Projects n/a

SingularityNET

Apr. 16, 2025

This RFP seeks the development of advanced tools and techniques for interfacing with, refining, and evaluating knowledge graphs that support reasoning in AGI systems. Projects may target any part of the graph lifecycle — from extraction to refinement to benchmarking — and should optionally support symbolic reasoning within the OpenCog Hyperon framework, including compatibility with the MeTTa language and MORK knowledge graph. Bids are expected to range from $10,000 - $200,000.

Proposal Description

Proposal Details Locked…

In order to protect this proposal from being copied, all details are hidden until the end of the submission period. Please come back later to see all details.

Total Milestones
5
Total Budget
$70,000_USD
Last Updated
27 May 2025

Milestone 1 - Kick-off and research

Description

1) Set up and sign the contract. 2) Extend the preliminary research done as prepartion for this proposal. This means a broad literature and code review of existing functionality in cheminformatics and computer-aided drug discovery today will be performed in order to find out what implementations are actively maintained, backed by publications, deliver reliable results, and can be used in combination in a shared conda environment.

Deliverables

The results of the broad literature and code review are provided in form of a GitHub repository rather than a PDF report so that other researchers can find and extend it in an easy way. If determined suitable, it may adhere to the style of an "awesome list" repository to make it easily findable and recognizable.

Budget

$10,000 USD

Success Criterion

1) Contract is signed. 2) GitHub repository is set up and contains the described content.

Milestone 2 - Design

Description

Decide which 5 to 7 external projects are going to be covered and what structure the unified API is going to take. The precise number of projects will depend on how much functionality each of them provides and how complex it is to abstract it into a functional interface that maximizes mutual compatibility. From the current point of view, good candidates seem to be OpenBabel, RDKit, Indigo and PaDeL-Descriptor, which cover a wide range of functionality, e.g. format conversions, descriptor and fingerprint calculations, 3D structure generation, 2D and 3D visualization, tautomer enumeration, etc. Reasonable additions from the perspective of broad methodological coverage could be a dedicated 3D conformer generator like Balloon, a molecular docking program like AutoDock Vina, and perhaps a quantum chemical toolkit like Psi4 for slower but more accurate geometry and electronic structure prediction that could be used as basis for molecular dynamics simulation.

Deliverables

The results of the project selection and API design will be a Python package with a scaffold for the covered toolkits and a few sample functions already implemented to ensure the outline works as intended.

Budget

$10,000 USD

Success Criterion

1) A GitHub repository is set up and contains the partial Python package. It reflects the chosen design for the unified API in form of a hierarchy of subpackages (=folders), modules (=files) and functions (=text in the files).

Milestone 3 - Implementation

Description

Fully implement the Python package that provides a unified API. The aim is to cover a large portion of the methods provided in the chosen external projects, though some very specific methods may not be of broad interest and therefore omitted.

Deliverables

Completed Python package, including a test suite with high coverage of the codebase, and everything required for distributing it as easily installable package on a suitable repository.

Budget

$23,000 USD

Success Criterion

1) The GitHub repository contains the completed Python package. The scaffold from the previous milestone is now filled with a lot of implementation details (=text in the files has considerably expanded). There is also a test folder containing code for automated testing of the implementation. Documentation will follow in the last milestone. 2) A distributed Python package is set up and available either on PyPI or on Anaconda.

Milestone 4 - Application

Description

Apply the Python package on a proof-of-concept case study. A reasonably sized biomedical knowledge graph will be ported to MeTTa and then augmented with various functions provided in the package, either by manual or automatic registering them as grounded atoms in OpenCog Hyperon. Ideally MORK will be used as backend if the project is mature enough at this point. The goal is not only to annotate the BMKG but also to perform interesting queries and analyses on it, e.g. basic filtering up to embedding and QSAR modeling or supervised ML.

Deliverables

Proof-of-concept case study that applies the package on a BMKG such as Hetionet and performs some downstream analyses.

Budget

$15,000 USD

Success Criterion

1) Code and documentation to reproduce the case study and apply similar calculations to other BMKGs with small molecules in them is provided at a suitable location.

Milestone 5 - Documentation

Description

Generate a technical documentation website for the Python package and a summary PDF report for the entire project.

Deliverables

Code documentation so that external developers can use it without further help. PDF report so the project and its results can be understood by anyone interested.

Budget

$12,000 USD

Success Criterion

1) GitHub repository and website that documents the Python package are set up. 2) PDF report that summarizes the project is provided at a suitable location.

Join the Discussion (0)

Expert Ratings

Reviews & Ratings

No Reviews Avaliable

Check back later by refreshing the page.

The weighted average of the 4 perspectives Overall

0.0
Each RFP defines a maximum allowed budget, but teams can differentiate their proposal by offering a solution with a lower budget or a wider scope.Value for money

0.0
This rating indicates compliance to 'Must haves' but also adaptation of 'Nice to haves' and Non-functional requirements defined in the RFP.Compliance with RFP requirements

0.0
RFPs will offer varying degrees of freedom. This rating indicates the quality of the team's specific solution ideas, the provided details, and the reviewer's confidence in the team's ability to execute.Solution details and team expertise

0.0

Review Headline

0 /50 chars

Review Summary

0 /5000 chars

The weighted average of the 4 perspectives Overall

0.0
Each RFP defines a maximum allowed budget, but teams can differentiate their proposal by offering a solution with a lower budget or a wider scope.Value for money

0.0
This rating indicates compliance to 'Must haves' but also adaptation of 'Nice to haves' and Non-functional requirements defined in the RFP.Compliance with RFP requirements

0.0
RFPs will offer varying degrees of freedom. This rating indicates the quality of the team's specific solution ideas, the provided details, and the reviewer's confidence in the team's ability to execute.Solution details and team expertise

0.0

Review Headline

0 /50 chars

0 /5000 chars

Warning: Adding final group rating for this project will prevent expert users from adding new or editing existing reviews

Reviews and Ratings in Deep Funding are structured in 4 categories. This will ensure that the reviewer takes all these perspectives into account in their assessment and it will make it easier to compare different projects on their strengths and weaknesses. Overall (Primary) This is an average of the 4 perspectives. At the start of this new process, we are assigning an equal weight to all categories, but over time we might change this and make some categories more important than others in the overall score. (This may even be done retroactively). Feasibility (secondary) This represents the user\'s assessment of whether the proposed project is theoretically possible and if it is deemed feasible. E.g. A proposal for nuclear fission might be theoretically possible, but it doesn’t look very feasible in the context of Deep Funding. Viability (secondary) This category is somewhat similar to Feasibility, but it interprets the feasibility against factors such as the size and experience of the team, the budget requested, and the estimated timelines. We could frame this as: “What is your level of confidence that this team will be able to complete this project and its milestones in a reasonable time, and successfully deploy it?” Examples:

A proposal that promises the development of a personal assistant that outperforms existing solutions might be feasible, but if there is no AI expertise in the team the viability rating might be low.
A proposal that promises a new Carbon Emission Compensation scheme might be technically feasible, but the viability could be estimated low due to challenges around market penetration and widespread adoption.

Desirability (secondary) Even if the project team succeeds in creating a product, there is the question of market fit. Is this a project that fulfills an actual need? Is there a lot of competition already? Are the USPs of the project sufficient to make a difference? Example:

Creating a translation service from, say Spanish to English might be possible, but it\'s questionable if such a service would be able to get a significant share of the market

Usefulness (secondary) This is a crucial category that aligns with the main goal of the Deep Funding program. The question to be asked here is: “To what extent will this proposal help to grow the Decentralized AI Platform?” For proposals that develop or utilize an AI service on the platform, the question could be “How many API calls do we expect it to generate” (and how important / high-valued are these calls?). For a marketing proposal, the question could be “How large and well-aligned is the target audience?” Another question is related to how the budget is spent. Are the funds mainly used for value creation for the platform or on other things? Examples:

A metaverse project that spends 95% of its budget on the development of the game and only 5 % on the development of an AI service for the platform might expect a low ‘usefulness’ rating here.

A marketing proposal that creates t-shirts for a local high school, would get a lower ‘usefulness’ rating than a marketing proposal that has a viable plan for targeting highly esteemed universities in a scaleable way.
An AI service that is fully dedicated to a single product, does not take advantage of the purpose of the platform. When the same service would be offered and useful for other parties, this should increase the ‘usefulness’ rating.

Robert Haas

Project Owner

Project Manager (Proposal), Research Lead (Review of existing methods), Software Designer & Developer (Package implementation), Technical Writer (Documentation and Report)

View Profile