simuliinc
Project OwnerSimuli Inc, led by neuroscience expert Dr. St.Clair and semiconductor veteran Binoy Syed, will execute this 6-month MeTTa corpus development project leveraging their AI and LLM expertise.
This proposal outlines the creation of a 20,000-pair MeTTa language corpus to enable training of an AI coding assistant. The approach involves generating instruction-output pairs through a combination of data collection, processing, and synthesis, followed by rigorous validation using both automated and human review. The corpus will cover six key areas including arithmetic, functional programming, and AGI-specific tasks. The $35,000, 6-month project delivers the corpus, validation tools, documentation, and a roadmap for future updates. A unique aspect is that the extraction/generation model can later validate the resulting MeTTa LLM and a pedagogical approach.
Develop a MeTTa language corpus to enable the training or fine-tuning of an LLM and/or LoRAs aimed at supporting developers by providing a natural language coding assistant for the MeTTa language.
In order to protect this proposal from being copied, all details are hidden until the end of the submission period. Please come back later to see all details.
Generate and validate first batch of instruction-output pairs covering arithmetic operations and functional programming paradigms in MeTTa. Establish initial validation framework.
6,000-7,000 validated instruction-output pairs Initial extraction/generation model Validation tooling first version Documentation of processes used
$10,000 USD
95% pass rate on automated validation checks Human expert validation of random 10% sample Successful execution of all code samples Documentation peer reviewed by 2 team members
Develop and validate pairs focused on symbolic reasoning and graph operations. Enhance validation framework based on learnings.
Additional 6000-7000 validated pairs Improved validation framework Updated extraction/generation model Integration tests for new pairs
$10,000 USD
97% pass rate on automated validation Cross-validation by separate model implementations All graph operations verified with test cases Zero conflicts with existing corpus
Complete corpus with AGI-specific tasks and probabilistic models while refining overall quality.
Final 6000-7000 validated pairs Finalized validation system Complete extraction/generation model Comprehensive test suite
$10,000 USD
99% pass rate on automated validation Full coverage of specified AGI tasks Successful integration with Hyperon framework All probabilistic models verified accurate
Package all tools create comprehensive documentation and establish future maintenance protocols.
Complete 20k pair corpus All source code and tools Comprehensive documentation Tutorial videos and examples
$5,000 USD
Successful test runs by external developers Documentation covers all major use cases Tools successfully deployed in test environment Positive feedback from user testing
Reviews & Ratings
Please create account or login to write a review and rate.
Check back later by refreshing the page.
© 2025 Deep Funding
Join the Discussion (0)
Please create account or login to post comments.