INTPYTHON-750: Support converting $expr with $getField paths#392
INTPYTHON-750: Support converting $expr with $getField paths#392
Conversation
Co-authored-by: Noah Stapp <noah.stapp@mongodb.com>
b99aba3 to
94eefad
Compare
| expr = { | ||
| "$expr": { | ||
| "$gt": [ | ||
| {"$getField": {"input": "$price", "field": "value"}}, |
There was a problem hiding this comment.
Can we optimize this down to remove the $getField when the field names don't contain $ or .? Something like:
{
"$match": {
"$expr": {
"$gt": ["$price.value", "$discounted_price.value"]
}
}
}
There was a problem hiding this comment.
We could. I'm trying to think if there's any case where this would be an issue if it mistakenly resolved it.
There was a problem hiding this comment.
This would have to be a two-phase conversion. I think it may even be best to still change it in our actual query code.
There was a problem hiding this comment.
I'm assuming here that this change ends up being needed. Holding off until we either get customer complaints or the refactor can't solve this makes more sense to me for now.
There was a problem hiding this comment.
Yeah, I tried to make it work (and I think I managed to), but the silent mutation of $getField to $field is worrying to me. Even if I manage to make things pass tests, it just feels out of scope of the optimizers goal, which is to explicitly change things from $expr to $match.
There was a problem hiding this comment.
Pull Request Overview
This PR implements support for $getField expressions in the expression converter system, allowing MongoDB aggregation expressions using $getField to be optimized into simpler query conditions.
- Adds helper methods to detect and convert
$getFieldexpressions to dot notation paths - Updates binary and
$inoperators to handle$getFieldexpressions alongside simple field references - Adds comprehensive test coverage for
$getFieldconversion scenarios
Reviewed Changes
Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.
| File | Description |
|---|---|
| django_mongodb_backend/query_conversion/expression_converters.py | Core implementation of $getField support with path conversion methods and updated operator handling |
| tests/expression_converter_/test_op_expressions.py | Test cases for $getField conversion across all binary operators and $in operator |
| tests/expression_converter_/test_match_conversion.py | Integration tests for $getField optimization in match expressions |
Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.
| # Check if first argument is a simple field reference | ||
| # Check if second argument is a list of simple values | ||
| if (field_name := cls.convert_path_name(field_expr)) and ( | ||
| isinstance(values, list | tuple | set) |
There was a problem hiding this comment.
Use explicit tuple syntax (list, tuple, set) instead of union operator | for better compatibility with older Python versions.
| isinstance(values, list | tuple | set) | |
| isinstance(values, (list, tuple, set)) |
|
closing in lieu of #396 |
Summary
An addendum to INTPYTHON-739 This PR implements support for
$getFieldexpressions in the expression converter system, allowing MongoDB aggregation expressions using$getFieldto be optimized into simpler query conditions. This is especially useful when optimizing queries against embedded model fields.The format of
$getFieldcalls need to meet these conditions:{ $getField: { input: "$path" | { $getField : {...} }, # A string starting with "$" field: "path" # A string with no "$" or "." } }This supports nested
$getFieldoperations.Changes in this PR
Test Plan
python manage.py shellSee an example query below.Screenshots (grabbed text)
Callouts
$getFieldcall unless they also feed into a$matchconversion because silently mutating$getFieldshould best be dealt in the larger refactor.NoneTypeis used a lookup, the optimization needs to wrap two statements in an$andquery.