Fix nls crash, and better refresh diagnostics. #1944

jneem · 2024-06-05T20:36:35Z

In nls, if main.ncl imports dep.ncl and dep.ncl changes, we need
to invalidate some cached data about main.ncl and re-check it.
Previously we were simply resetting main.ncl to the "parsed" state in
the cache, but this isn't sufficient because our cached term for
main.ncl might depend on dep.ncl. Specifically, import resolution of
main.ncl transforms the term in a way that depends on whether
dep.ncl parsed.

This PR does the most inefficient (but easiest to get correct)
thing: throwing out all the parsed data and re-parsing from scratch.
This can be optimized, but it probably isn't too much slower than the
status quo, because we were re-typechecking from scratch anyway.

Fixes #1943 and maybe also #1942 and #1935. There are still some bugs related to cache state in the background worker, which I will investigate next.

In nls, if `main.ncl` imports `dep.ncl` and `dep.ncl` changes, we need to invalidate some cached data about `main.ncl` and re-check it. Previously we were simply resetting `main.ncl` to the "parsed" state in the cache, but this isn't sufficient because our cached term for `main.ncl` might depend on `dep.ncl`. Specifically, import resolution of `main.ncl` transforms the term in a way that depends on whether `dep.ncl` parsed. This commit does the most inefficient (but easiest to get correct) thing: throwing out all the parsed data and re-parsing from scratch. This can be optimized, but it probably isn't too much slower than the status quo, because we were re-typechecking from scratch anyway.

When `main.ncl` imports `dep.ncl` and `dep.ncl` changes, we invalidate `main.ncl` and re-do some checks. This commit adds background invalidation to the list of checks that we re-do.

yannham · 2024-06-06T09:12:20Z

core/src/cache.rs

+    /// Remove the cached term associated with this id.
+    ///
+    /// The file contents associated with this id remain, and they will be re-parsed if necessary.
+    pub fn reset(&mut self, file_id: FileId) {


I have mixed feeling about this function. I wonder if it can lead to inconsistent state, like the entry's state saying that the file is parsed while it's not in the terms cache. Or some ResolvedImport somewhere depending on this file_id which suddenly becomes invalid.

I guess a first step would be to change the state of the entry as well. I would prefer this function to be private, but nls needs it obviously. Maybe both a comment warning about the fact that invalidating a file_id that may be in use somewhere is "dangereous" (can cause panics in practice), and/or giving the function an explicit name, like unsafe_reset or _reset or whatever?

Another possibility might be to handle the re-parsing in the same function, so that you can't observe externally the missing file, but I don't know if that would fit your workflow here.

I guess I should be clearing some of the other state (i.e. imports, rev_imports, and wildcards), but the entry state is contained in terms so there's no chance in it being stale, right?

After discussion at the weekly, I'll refactor this so that Cache's API does the rev-dependency invalidation. This way Cache's public API can't leave things in an invalid state.

yannham · 2024-06-06T09:15:22Z

lsp/nls/src/world.rs

+            // Note that this will cause the contents to be re-parsed, which might be avoidable
+            // in some situations. It's safest to completely reset the state because post-parsing
+            // transformations (especially import resolution) can change the cached term in ways
+            // that are invalidated by the changes we just received.


So, morally, would re-running the pipeline from import resolution (and then typechecking and analysis) be sufficient? Of course not in the current state, it would need some adaptation to program transformation. I'm just trying to understand at a high-level how we could make it better in the long term.

I think re-running from import resolution should work, but then we'd need to keep a copy of the pre-import-resolution term (right now we consume it to produce the import-resolved term).

The more performant option would be to keep track of what changed with the dependency: if the dependency's success/failure status didn't change, we don't need to re-run import resolution. If the dependency's type didn't change, we don't need to re-run typechecking.

jneem · 2024-06-07T17:43:21Z

@yannham I've implemented the solution we discussed yesterday, where the full recursive invalidation happens in the cache.

It occurred to me that there's still some possibility for misuse in the cache API, since replace_string deletes the cached term without invalidating things that import it. Maybe replace_string should do the invalidation automatically? I didn't do that here because it's more invasive...

yannham · 2024-06-10T08:25:36Z

It occurred to me that there's still some possibility for misuse in the cache API, since replace_string deletes the cached term without invalidating things that import it. Maybe replace_string should do the invalidation automatically? I didn't do that here because it's more invasive...

Hmm, I'm not entirely set on this. On one hand you're right that this has the same risk of kinda inconsistent state lying around. On the other hand replace_string is used in practice for like queries and error message snippets, which shouldn't be imported from anywhere. Well then maybe it's virtually free to invalidate the reverse dependencies, because in practice there shouldn't be any?

yannham · 2024-06-10T08:28:02Z

lsp/nls/src/world.rs

-        // Invalidate any cached inputs that imported the newly-opened file, so that any
-        // cross-file references are updated.


Shouldn't this comment stay?

jneem · 2024-06-10T14:12:20Z

Well then maybe it's virtually free to invalidate the reverse dependencies, because in practice there shouldn't be any?

That's probably right, I just didn't want this bugfix to have unintended consequences...

jneem added 3 commits June 5, 2024 15:27

Add a test

a45cc95

Re-do background evaluation for invalidated files.

98877c3

When `main.ncl` imports `dep.ncl` and `dep.ncl` changes, we invalidate `main.ncl` and re-do some checks. This commit adds background invalidation to the list of checks that we re-do.

github-actions bot temporarily deployed to pull request June 5, 2024 20:39 Inactive

jneem mentioned this pull request Jun 5, 2024

Non-deterministic typecheck LSP errors when modifying a transitive import of a file #1942

Open

yannham reviewed Jun 6, 2024

View reviewed changes

Move the recursive invalidation into Cache

9735d39

github-actions bot temporarily deployed to pull request June 7, 2024 17:09 Inactive

jneem requested a review from yannham June 7, 2024 17:43

yannham approved these changes Jun 10, 2024

View reviewed changes

Reinstate/reword comment

f0b0151

jneem enabled auto-merge June 10, 2024 14:13

github-actions bot temporarily deployed to pull request June 10, 2024 14:13 Inactive

jneem added this pull request to the merge queue Jun 10, 2024

Merged via the queue into master with commit 446128b Jun 10, 2024
5 checks passed

jneem deleted the nls-crash branch June 10, 2024 14:36

jneem mentioned this pull request Jun 10, 2024

Update dependencies in the background evaluator #1948

Merged

BrewTestBot mentioned this pull request Jun 11, 2024

nickel 1.7.0 Homebrew/homebrew-core#174281

Merged

yannham mentioned this pull request Jun 19, 2024

Unresolved import causes nls to panic #1935

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix nls crash, and better refresh diagnostics. #1944

Fix nls crash, and better refresh diagnostics. #1944

jneem commented Jun 5, 2024

yannham Jun 6, 2024

jneem Jun 6, 2024

jneem Jun 6, 2024

yannham Jun 6, 2024

jneem Jun 6, 2024

jneem commented Jun 7, 2024

yannham commented Jun 10, 2024 •

edited

Loading

yannham Jun 10, 2024

jneem commented Jun 10, 2024

		// Invalidate any cached inputs that imported the newly-opened file, so that any
		// cross-file references are updated.

Fix nls crash, and better refresh diagnostics. #1944

Fix nls crash, and better refresh diagnostics. #1944

Conversation

jneem commented Jun 5, 2024

yannham Jun 6, 2024

Choose a reason for hiding this comment

jneem Jun 6, 2024

Choose a reason for hiding this comment

jneem Jun 6, 2024

Choose a reason for hiding this comment

yannham Jun 6, 2024

Choose a reason for hiding this comment

jneem Jun 6, 2024

Choose a reason for hiding this comment

jneem commented Jun 7, 2024

yannham commented Jun 10, 2024 • edited Loading

yannham Jun 10, 2024

Choose a reason for hiding this comment

jneem commented Jun 10, 2024

yannham commented Jun 10, 2024 •

edited

Loading