- Groups of 3 to 4 students (A solo or duo project is possible, but not recommended, for 1470 with capstone or Ph.D. students. The student must check with the instructor and get approval.)
- Present and submit a poster/slides of the project at the end of the semester recapping your work using GitHub.
- We will host a "Deep Learning Day" at the end of the semester. Student groups will participate in-person for poster presentations.
- Participate in "Deep Learning Day" by engaging with other student's projects at the end of the semester.
- Meet all deadlines and check-ins specified below.
- Your code can be written using any deep learning library, such as TensorFlow, Keras, Jax, or PyTorch.
## Scope
The project is "open-ended," meaning it's open to interpretation. For 1470-noncapstone students, there are two options. For 1470-Capstone students, you must choose the **second** option of solving a new problem.
### Option 1: Re-implement a research paper
Find a paper from a recent machine learning conference that describes a deep-learning-based system, and try to reproduce its results. For this approach to be valid, the re-implementation must not be a trivial effort. If there's already open-source code that comes with the paper, you can still do it, but you'd need to at least all of the following:
1. Implement the system in a different framework (e.g. PyTorch instead of TensorFlow)
2. Try your implementation on a different dataset than the one(s) in the paper
3. Extend the implementation (new feature, interpretability, etc) in a *non-trivial manner*.
We'll also ask you to share links to any public implementations you come across for verifying your code is your own work. If you need inspiration for potential papers to try, look through the recent proceedings of the following conferences:
- AI / Machine Learning/ Data Mining:
- The International Conference on Learning Representations (ICLR)
- The International Conference on Machine Learning (ICML)
- Knowledge Discovery and Data Mining (KDD)
- Association for the Advancement of AI (AAAI)
- The Reinforcement Learning Conference (RLC)
- Computer Vision
- Computer Vision and Pattern Recognition (CVPR)
- The International/European Conference on Computer Vision (ICCV/ECCV)
- Natural Language Processing
- The Association for Computational Linguistics (ACL)
- Empirical Methods in Natural Language Processing (EMNLP)
- Computational Biology and Health
- Research in Computational Molecular Biology (RECOMB)
- Intelligent Systems for Molecular Biology /European Conference on Computational Biology (ISMB/ECCB)
- Pacific Symposium on Biocomputing (PSB)
Most paper authors will have made pre-prints publicly available on their personal websites or via [arXiv](https://arxiv.org/). Empirically, it is often a good idea to pick research papers whose source code has been released by the authors. This gives you a good idea how easy it is to reproduce the results with their own codes, and the amount of work required for reimplementation.
### Option 2: Try to solve a new problem
- You run a thorough analysis of model bias (e.g. gender, racial) for one or a collection of popular deep learning models that have been open-sourced.
Example uses that are **not** permitted:
- You take open-sourced model checkpoints and just "fine-tune" them on another dataset.
- You take an open-sourced framework and replace its ResNet-50 with a ResNet-101.
Please cite all the open-source frameworks you used in the final report, and check with your mentor TA if they are okay with your proposal.
You can check out the projects in the [previous Deep Learning Day](https://brown-deep-learning-day-f2021.devpost.com/project-gallery) or [Stanford CS231n](http://cs231n.stanford.edu/2017/reports.html) to draw some inspiration, but your project needs to differentiate from previous projects in meaningful ways.