Skip to content
Joshua Wu edited this page Apr 13, 2024 · 7 revisions

Background

The data.table package is an invaluable tool for data analysis and manipulation.

Related work /impact

data.table is one of the most widely used R packages.

Details of your coding project

Find one or more students to fix/close some of the hundreds of outstanding issues.

Maybe would be good to start with the issues labeled as beginner-task.

It would be easier for me if a student approached and said something like :

  • “I want to do some serious C at low level”. I could then propose something there.
  • “I want to write documentation/vignette/whitepaper”. Then I could suggest writing about data.table code that hasn’t been written about before.
  • “I want to close 100 issues, one per day”. Then maybe I could pick the ones where that might be possible.
  • “I want to work on performance testing / benchmarking”
  • “I want to work on performance testing/ using atime for benchmarking”. I will teach on how to use atime to check if reported data.table issues have been fixed.

Mentors/tests

Test is to find an issue labeled as beginner-task,

  • try to reproduce on your own computer, and report your results on the issue (were you able to reproduce? or not?)
  • create a PR which would close that issue.

When you have finished at least one test, please add a link to it on this page, then contact the following mentors:

  • Toby Dylan Hocking @tdhock
  • Anirban Chetia @Anirban166
  • Doris Amoakohene @DorisAmoakohene

Potential contributor test results (to edit)

  • Contributor Name, link to issue, link to PR
  1. Atrayee Samanta, link to the issue, link to PR
  2. Nitish Jha link to the issue, link to PR
  3. Josh Wu link to the issue, link to the PR