Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor internal components #6360

Open
JorTurFer opened this issue Nov 25, 2024 · 1 comment
Open

Refactor internal components #6360

JorTurFer opened this issue Nov 25, 2024 · 1 comment
Labels
feature-request All issues for new features that have not been committed to needs-discussion stale-bot-ignore All issues that should not be automatically closed by our stale bot

Comments

@JorTurFer
Copy link
Member

JorTurFer commented Nov 25, 2024

Proposal

I've been thinking about our code and I think that it's a bit coupled, making difficult to manage the life cycle of the items internally sometimes. (During the last months we have had several issues related with the scaler lifecycle, not properly closed connections, etc) and I don't like so much some parts of the core code. IMHO there are mixed responsibilities about what each thing does, and we have some duplications (almost the same code) because the querying way when internal scale loop is slightly different from the HPA querying way. Although the goal and the main idea is quite similar, the code is duplicated. This idea has been in my head for some time and recent issues related with the scaler lifecycle and unclosed connections have pushed it out.

My proposal is to split the responsibilities in different layers (I'm not speaking about tons of interfaces, DTOs and clean code patterns at all, just from resposibities pov), where we manage each layer isolated from others, not sharing the internal information to avoid coupling them.
Disclaimer: Please, ignore the term "service", I used it to make a difference between our controllers (strongly related with k8s controllers) and the controllers/layers/services.

The main idea is that instead of querying metrics in different ways, we have a single layer (scaler service) where all the scalers of an scalableobject (SO/SJ) live, and other layers on top that consume this scaler layer but totally as blackbox. Let's explain the SO flow:

  1. A ScaledObject is created
  2. ScaledObject controller creates the HPA and calls the scaledobject service to register the ScaledObject
  3. The ScaledObject service pass the ScaledObject to the scalableobject service (a common service for SO and SJ) and starts a loop that queries every polling interval the status of the ScaledObject (Active/Inactive).
    1. If the scaler changes the status, it scales from/to 0
  4. The scalableobject service parses the scaledobject received and call to the scaler service to create the required scalers.
    1. If the scaler configurations/types/etc changes, this service call to the scaler service to update/remove the scalers
    2. This service can be queried about the scalableobject state or just about a single metric
    3. This service calls to fallback layer to calculate the fallback if the SO defines it and there is a fail on a scaler
    4. This service calls to the formula calculator if required
  5. The scaler service is the responsible for creating/refreshing/deleting the scalers when it's required, but also is the responsible for querying the specific scaler.
    1. Scalers are treated as single items, not knowing about other scalers in the SO/SJ
    2. This service stores (or not) the metric based on an arg, e.g: we could use an enum to defice if we have to READ,READ&STORE or READFROMCACHE
    3. The service handles scaler recreation on demand when it's required (e,g: failures)
    4. The service handles the scaler creation, including the arrange of envs from auth sources

image

@JorTurFer JorTurFer added needs-discussion feature-request All issues for new features that have not been committed to labels Nov 25, 2024
@JorTurFer JorTurFer added the stale-bot-ignore All issues that should not be automatically closed by our stale bot label Nov 27, 2024
@SpiritZhou
Copy link
Contributor

This is a good improvement that makes the structure and responsibilities cleaner. I just have a few detailed questions:

  1. Do the ScaledJob controller and ScaledObject controller each have a looping structure? Will every ScaledJob/ScaledObject start a loop?
  2. Is it correct that the cached metric value of the scaler will only be refreshed when a ScaledJob/ScaledObject calls after the poll interval or when the metric server is queried?
  3. The formula calculator and fallback layer are called by the scalableobject service, but each ScaledJob/ScaledObject's definition is different. Will the scalableobject service query back to the ScaledJob/ScaledObject controller to retrieve the definition, or will the ScaledJob/ScaledObject controller pass each ScaledJob/ScaledObject's formula and fallback definition to the formula calculator and fallback layer during creation?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature-request All issues for new features that have not been committed to needs-discussion stale-bot-ignore All issues that should not be automatically closed by our stale bot
Projects
Status: To Triage
Development

No branches or pull requests

2 participants