Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make groupby.size more efficient #371

Closed
wants to merge 3 commits into from
Closed

Conversation

phofl
Copy link
Collaborator

@phofl phofl commented Oct 27, 2023

groupby().size() reduces to a series- The only time where the column that we are operating on matters is for groupby().x.size() where x influences the result name. So We can get rid of all columns to reduce memory consumption

cc @mrocklin query 1 of tpch

@phofl phofl closed this Dec 29, 2024
@phofl phofl deleted the size_optimization branch December 29, 2024 18:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant