fix : Added Examples of Named List Elements in i to Improve Clarity #6868

venom1204 · 2025-03-15T08:21:11Z

closes #1945
This PR updates data.table joins to use named lists (on = .(...)) instead of unnamed character vectors (on = c("col1", "col2")).
in this I modified
*** vignettes/datatable-secondary-indices-and-auto-indexing.Rmd:

Added a subsection titled "Using named list elements in i" under Fast subsetting using on argument and secondary indices.
Demonstrated the difference between unnamed and named list elements with examples for single and multiple keys.

@jangorecki @MichaelChirico can you please review this when you have time.
thanks.

codecov · 2025-03-15T08:27:47Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.59%. Comparing base (538c491) to head (f626801).
Report is 9 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #6868   +/-   ##
=======================================
  Coverage   98.59%   98.59%           
=======================================
  Files          79       79           
  Lines       14661    14661           
=======================================
  Hits        14455    14455           
  Misses        206      206

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

MichaelChirico · 2025-03-21T05:27:05Z

vignettes/datatable-secondary-indices-and-auto-indexing.Rmd

@@ -191,33 +191,64 @@ flights[.("JFK", "LAX"), on = c("origin", "dest")][1:5]

 * Since the time to compute the secondary index is quite small, we don't have to use `setindex()`, unless, once again, the task involves repeated subsetting on the same column.

-### b) Select in `j`


I think we can shrink the diff in this PR to basically one line. Instead of a new 'c)' entry here, I would add a bullet point here like

* For clarity/readability, it might help to name the inputs to `i`, e.g. `flights[.(origin = "JFK", dest = "LAX"), on=c("origin", "dest")]`

I'm not convinced any of the other changes add enough value to warrant inclusion.

@MichaelChirico , I've updated the section based on your suggestion. Do you think it needs more detail, or does it look good as is? Whenever you have time, I'd appreciate your review. Thanks!

vignettes/datatable-secondary-indices-and-auto-indexing.Rmd

MichaelChirico

Thanks!

added examples

4a416fc

venom1204 added 2 commits March 15, 2025 16:28

updated version

c98ba08

Merge branch 'master' into issue1945

f4121a0

venom1204 marked this pull request as ready for review March 15, 2025 11:26

venom1204 requested a review from MichaelChirico as a code owner March 15, 2025 11:26

Merge branch 'master' into issue1945

8230038

MichaelChirico reviewed Mar 21, 2025

View reviewed changes

venom1204 added 2 commits March 21, 2025 12:11

Merge branch 'master' into issue1945

b76dc29

seggested improvements

a8a03e9

MichaelChirico reviewed Mar 21, 2025

View reviewed changes

vignettes/datatable-secondary-indices-and-auto-indexing.Rmd Outdated Show resolved Hide resolved

MichaelChirico reviewed Mar 21, 2025

View reviewed changes

vignettes/datatable-secondary-indices-and-auto-indexing.Rmd Show resolved Hide resolved

tweak

046c9da

MichaelChirico reviewed Mar 21, 2025

View reviewed changes

vignettes/datatable-secondary-indices-and-auto-indexing.Rmd Outdated Show resolved Hide resolved

further tweak

f626801

MichaelChirico approved these changes Mar 21, 2025

View reviewed changes

MichaelChirico merged commit 6aa99f5 into Rdatatable:master Mar 21, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix : Added Examples of Named List Elements in i to Improve Clarity #6868

fix : Added Examples of Named List Elements in i to Improve Clarity #6868

venom1204 commented Mar 15, 2025 •

edited

Loading

codecov bot commented Mar 15, 2025 •

edited

Loading

MichaelChirico Mar 21, 2025

venom1204 Mar 21, 2025

MichaelChirico left a comment

		@@ -191,33 +191,64 @@ flights[.("JFK", "LAX"), on = c("origin", "dest")][1:5]

		* Since the time to compute the secondary index is quite small, we don't have to use `setindex()`, unless, once again, the task involves repeated subsetting on the same column.

		### b) Select in `j`

fix : Added Examples of Named List Elements in i to Improve Clarity #6868

fix : Added Examples of Named List Elements in i to Improve Clarity #6868

Conversation

venom1204 commented Mar 15, 2025 • edited Loading

codecov bot commented Mar 15, 2025 • edited Loading

Codecov Report

MichaelChirico Mar 21, 2025

Choose a reason for hiding this comment

venom1204 Mar 21, 2025

Choose a reason for hiding this comment

MichaelChirico left a comment

Choose a reason for hiding this comment

venom1204 commented Mar 15, 2025 •

edited

Loading

codecov bot commented Mar 15, 2025 •

edited

Loading