Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP Support asynchronous contents managers #1021

Open
wants to merge 11 commits into
base: main
Choose a base branch
from

Conversation

mwouts
Copy link
Owner

@mwouts mwouts commented Dec 10, 2022

May fix #1020

@mwouts mwouts force-pushed the support_async_contentsmanagers branch 2 times, most recently from ca191d7 to 45fde60 Compare December 10, 2022 18:39
@mwouts mwouts force-pushed the support_async_contentsmanagers branch from 45fde60 to c5bdeed Compare December 10, 2022 23:58
@mwouts
Copy link
Owner Author

mwouts commented Dec 11, 2022

@blink1073 thanks for your advices at jupyter-server/jupyter_server#1127

At this point (c5bdeed) I have been able to make the test pass on the contents manager (and, as you suggested, the tests are setup to test both the CM derived from the LargeFileManager and the CM derived from the AsyncLargeFileManager).

Now the problem I am facing is that I don't see how to contain the expansion of the async functions... Take for instance this function write_pair, which takes as a argument a function that writes one file. In the context manager the function write_one_file is async, so write_pair becomes async as well. Is there any way to avoid this? (*) Does it mean that every function in jupytext that ever read or writes a paired notebook will have to become async? And any function depending on them? 😄

(*): I tried this: loop = asyncio.get_event_loop(); oop.run_until_complete(...) but it complained about the loop being already running

image

@mwouts mwouts force-pushed the support_async_contentsmanagers branch from 593a4a9 to 47b95e2 Compare December 11, 2022 01:58
@mwouts
Copy link
Owner Author

mwouts commented Dec 11, 2022

In the end I've decided to write two functions: write_pair and write_pair_async in this commit - this seems to work well to limit the expansion of async functions.

@mwouts
Copy link
Owner Author

mwouts commented Dec 11, 2022

An experimental version with support for async contents manager is available with:

BUILD_JUPYTERLAB_EXTENSION=1 pip install git+https://github.com/mwouts/jupytext.git@support_async_contentsmanagers

We will need to test it in various environments before releasing that (and also I'd be curious to identify the limits, i.e. what happens if we test it with an older version of Jupyter, etc)

@mwouts
Copy link
Owner Author

mwouts commented Dec 13, 2022

I see at least two issues on the CI:

  1. Not all versions of jupyter_core have ensure_async
  2. I've had to add two new methods like rename or trust_notebook to the contents manager (as otherwise their base implementation was either not async, or not awaiting for the async calls), and at least one of them fail:
>       await cm.rename(tmp_nbpy, "new.nb.py")

tests/test_contentsmanager.py:278: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <jupytext.contentsmanager.build_jupytext_contents_manager_class.<locals>.JupytextContentsManager object at 0x7fd97f57ba60>
old_path = 'notebook.nb.py', new_path = 'new.nb.py'

    async def rename(self, old_path, new_path):
        """NB: This method was added for the async port"""
        await ensure_async(self.rename_file(old_path, new_path))
        await ensure_async(
            self.checkpoints.rename_all_checkpoints(old_path, new_path)
        )
>       self.emit(
            data={"action": "rename", "path": new_path, "source_path": old_path}
        )
E       AttributeError: 'JupytextContentsManager' object has no attribute 'emit'

Advice is welcome!

@blink1073
Copy link

  • We use "jupyter_core>=4.12,!=5.0.*", to pick up ensure_async.
  • I don't follow, it looks like there are sync and async versions of rename and trust_notebook in server, either should work with and ensure_async
  • self.emit is only available in server 2.0

@mwouts
Copy link
Owner Author

mwouts commented Dec 14, 2022

Thanks @blink1073 ! Oh yes I see then requiring jupyter_core as you stated, and jupyter_server>=2.0 should fix the two issues above.

Re the necessity to explicitly add the async def rename method in Jupytext's cm, I was referring to the sync implementation

def rename(self, old_path, new_path):
        """Rename a file and any checkpoints associated with that file."""
        self.rename_file(old_path, new_path)
        self.checkpoints.rename_all_checkpoints(old_path, new_path)
        self.emit(data={"action": "rename", "path": new_path, "source_path": old_path})

which is not compatible with an async rename_file in my derived class, hence the requirement to add that method - that's not really an issue, my only concern is that by doing so I duplicate some code and take the risk that it does not evolve like the base one.

@blink1073
Copy link

You can use run_sync from jupyter_core to convert an async function to a sync one, effectively.

@blink1073
Copy link

Ideally though, the Jupytext class would be async as well.

@LoicGrobol
Copy link

@mwouts Is there anything that would help you with this?

@mwouts
Copy link
Owner Author

mwouts commented Nov 22, 2023

Thank you @LoicGrobol for offering your help! Well this was a while ago, so I will need to look into it to answer more precisely.

I think I wanted to identify what version of Jupyter would be required if we were to make Jupytext's content manager fully async.

Also I was seeing a huge impact on the tests, probably because I was/am not aware of the best way to test async method, so I could use a few pointers there.

Also we will have to redo that PR from scratch, but maybe only when the test reorganization is over (one of the currently opened PR).

@LoicGrobol
Copy link

Ok I'll have a look into these points then :)

@mwouts mwouts force-pushed the support_async_contentsmanagers branch from 2243ba6 to 6875c84 Compare November 23, 2023 22:33
@mwouts
Copy link
Owner Author

mwouts commented Nov 23, 2023

Hi @LoicGrobol , I have done a quick rebase of this branch, so that in the future we can experiment with it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Can't open the jupyter notebook after the jupytext installation
3 participants