Evolving DNN Architectures
The goal of this project is to create a framework that uses evolutionary computation to design new neural network architectures for natural language prediction. The idea is to build on the success of transformers while exploring entirely new design possibilities that could outperform them. By simulating evolution—introducing variations, selecting the best-performing models, and iterating—the framework aims to uncover architectures that go beyond what human researchers have created. We aim to represent neural networks as directed acyclic graphs - and perform mutations and crossover on those representations. We also plan to allow mutations/crossover on model hyperparameters as well.
Join the Discussion (0)
Please create account or login to post comments.