How to generate 3D assets with more number of faces? #58

supersyz · 2024-12-12T10:20:14Z

Hi, Really appreciate for you great open-source work!
I notice the output objects have small number of faces, how to generate 3D assets with more number of faces?

kitcheng · 2024-12-12T10:30:32Z

In the example.py and app.py files, search for the term "simplify", and set it to a fixed value of 0.

This way, it will not reduce the number of faces.

JackDainzh · 2024-12-12T12:40:41Z

Better just edit the Simplify gradio slider to reach to the desired minimum in app.py

visualbruno · 2024-12-13T11:32:37Z

I changed the app.py, added the number of vertices, increased the sampling steps to 100 and changed the slider for mesh simplification from 0 to 0.98 and increased the max texture size to 4096.

Download my app.py file here

realisticdreamer114514 · 2024-12-14T03:59:28Z

@visualbruno what guidance strengths are the best for details/fidelity in the 2 stages? Do we really need 100 steps for this level of detail?

visualbruno · 2024-12-14T09:42:55Z

@visualbruno what guidance strengths are the best for details/fidelity in the 2 stages? Do we really need 100 steps for this level of detail?

Hi. I'm not sure if 100 steps is better than 50. Sometimes with 20, it is not good enough.
When the guidance strength is at 10 in the 2 stages, it respects more the image.

Check the screenshots:
1st screenshot : Guidance of 0
2nd screenshot : Guidance of 10

The 2nd screenshot, the fidelity is very good

cjjkoko · 2024-12-15T04:42:43Z

@visualbruno what guidance strengths are the best for details/fidelity in the 2 stages? Do we really need 100 steps for this level of detail?

Hi. I'm not sure if 100 steps is better than 50. Sometimes with 20, it is not good enough. When the guidance strength is at 10 in the 2 stages, it respects more the image.

Check the screenshots: 1st screenshot : Guidance of 0 2nd screenshot : Guidance of 10

The 2nd screenshot, the fidelity is very good

My params:
simply:0.7
texture_size:2048

seed=20,
sparse_structure_sampler_params={
"steps": 100,
"cfg_strength": 7.5,
},
slat_sampler_params={
"steps": 100,
"cfg_strength": 3.5,
},
It works fine, but I'm still working on better parameters, including modifying some that aren't exposed

visualbruno · 2024-12-15T11:33:53Z

I think, the biggest issue is about the input picture resolution that is resized to 518x518 in trellis_image_to_3d.py
I tested with a higher resolution like 2058x2058, but the result was horrible. Probably they trained the model with this low resolution.

cjjkoko · 2024-12-16T07:25:01Z

I think, the biggest issue is about the input picture resolution that is resized to 518x518 in trellis_image_to_3d.py I tested with a higher resolution like 2058x2058, but the result was horrible. Probably they trained the model with this low resolution.

Maybe postprocessing will get a great result

realisticdreamer114514 · 2024-12-16T08:49:37Z

@cjjkoko What kind of postprocessing do you use?

cjjkoko · 2024-12-16T09:31:36Z

@cjjkoko What kind of postprocessing do you use?

trimesh and open3d. use laplacian

realisticdreamer114514 · 2024-12-16T09:41:41Z

trimesh and open3d

These are trying to smooth the output meshes without trying to improve the quality during generation.
You can see that even with a high-resolution input image and high texture size e.g. 4096, much detail of the final meshes' texture map is lost when they should be preserved (I tested with some character cosplay photos and found out about this). Might be better if someone has the GPU power to train/finetune the I23D model for it to work in input image resolution of say 770^2 or 1036^2, since as visualbruno points out the pipeline is set at the resolution the official model was trained on (518^2) and this kind of downsizing might explain detail loss.

cjjkoko · 2024-12-16T09:51:10Z

trimesh and open3d

These are trying to smooth the output meshes without trying to improve the quality during generation. You can see that even with a high-resolution input image and high texture size e.g. 4096, much detail of the final meshes' texture map is lost when they should be preserved (I tested with some character cosplay photos and found out about this). Might be better if someone has the GPU power to train/finetune the I23D model for it to work in input image resolution of say 770^2 or 1036^2, since as visualbruno points out the pipeline is set at the resolution the official model was trained on (518^2) and this kind of downsizing might explain detail loss.

Yes, but there is no specific date for the training.

realisticdreamer114514 · 2024-12-18T09:00:46Z

there is no specific date for the training

Even at the current default resolution, the official I23D checkpoint seems quite undertrained (not sure if this is the right way to put it) so it doesn't adhere to the input image closely enough and tends to distort details that are still clear when downscaled. Finetuning on this framework can't come sooner...

cjjkoko · 2024-12-18T11:04:58Z

there is no specific date for the training

Even at the current default resolution, the official I23D checkpoint seems quite undertrained (not sure if this is the right way to put it) so it doesn't adhere to the input image closely enough and tends to distort details that are still clear when downscaled. Finetuning on this framework can't come sooner...

emmm, Expect great breaking updates in the next release. At this point, you can only adjust the seed to fit each image.I am currently using this very painful

visualbruno · 2024-12-18T18:13:20Z

I played with many parameters like the input image resizing, the number of sampling steps, texture resolution and the "number of views" used in the postprocessing. So I modified all main scripts and the app.py to play with these parameters.

The best result I got is with:

Input image resized to 770, instead of 518.
Number of Sampling Steps : 500
Texture resolution : 2048
Postprocessing "Number of Views": 120, instead of 100 (it removes a bit the artifacts on the texture)

For sure, with these values, it takes much more time to generate the model.

I tried to increase the input image resolution to 1036 and above, but the results were worse.
I tried with a number of sampling steps of 800 and 1000, but it did not improve the result too.
A texture resolution of 4096 is not better than 2048.
I tried with 200 for the "number of views" in post processing, it did not improve a lot the texture and the rendering time was multiplied by 10.

I tested with 2d anime pictures and it never renders very well, probably because this kind of pictures is flat and lacks relief and depth.

With Marlin from Seven Deadly Sins, Input Picture:

Result:

With Cleopatra, Input Picture:

Result:

With Knight, Input Picture:

Result:

QuantumLight0 · 2024-12-19T08:27:52Z

I played with many parameters like the input image resizing, the number of sampling steps, texture resolution and the "number of views" used in the postprocessing. So I modified all main scripts and the app.py to play with these parameters.

The best result I got is with:

Input image resized to 770, instead of 518.

Number of Sampling Steps : 500

Texture resolution : 2048

Postprocessing "Number of Views": 120, instead of 100 (it removes a bit the artifacts on the texture)

For sure, with these values, it takes much more time to generate the model.

I tried to increase the input image resolution to 1036 and above, but the results were worse. I tried with a number of sampling steps of 800 and 1000, but it did not improve the result too. A texture resolution of 4096 is not better than 2048. I tried with 200 for the "number of views" in post processing, it did not improve a lot the texture and the rendering time was multiplied by 10.

I tested with 2d anime pictures and it never renders very well, probably because this kind of pictures is flat and lacks relief and depth.

With Marlin from Seven Deadly Sins, Input Picture: Result:

With Cleopatra, Input Picture: Result:

With Knight, Input Picture: Result:

I believe the model is under trained for anime models. anime models have flat normals so no depth, so I believe that a model needs to be train on anime model' faces in order for it to understand the faces. I do belive however the multidiffusion has potential in this area and I will provide a sample of why I believe so.
https://github.com/user-attachments/assets/bd2bf57b-0c55-42ad-b28c-2f8b2e5d84f9

visualbruno · 2024-12-19T08:43:21Z

@QuantumLight0 What parameters did you use to generate this model ?

QuantumLight0 · 2024-12-19T08:49:47Z

@QuantumLight0 What parameters did you use to generate this model ?
I set everything to max

visualbruno · 2024-12-19T09:46:55Z

@QuantumLight0 I did not see they updated the repository with multi images algorithm. I will play with it.

cjjkoko · 2024-12-25T02:19:57Z

Any new breakthroughs ?

YuDeng added the good first issue Good for newcomers label Dec 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to generate 3D assets with more number of faces? #58

How to generate 3D assets with more number of faces? #58

supersyz commented Dec 12, 2024

kitcheng commented Dec 12, 2024

JackDainzh commented Dec 12, 2024

visualbruno commented Dec 13, 2024

realisticdreamer114514 commented Dec 14, 2024 •

edited

Loading

visualbruno commented Dec 14, 2024

cjjkoko commented Dec 15, 2024

visualbruno commented Dec 15, 2024

cjjkoko commented Dec 16, 2024

realisticdreamer114514 commented Dec 16, 2024

cjjkoko commented Dec 16, 2024

realisticdreamer114514 commented Dec 16, 2024 •

edited

Loading

cjjkoko commented Dec 16, 2024

realisticdreamer114514 commented Dec 18, 2024

cjjkoko commented Dec 18, 2024 •

edited

Loading

visualbruno commented Dec 18, 2024

QuantumLight0 commented Dec 19, 2024

visualbruno commented Dec 19, 2024

QuantumLight0 commented Dec 19, 2024

visualbruno commented Dec 19, 2024

cjjkoko commented Dec 25, 2024

How to generate 3D assets with more number of faces? #58

How to generate 3D assets with more number of faces? #58

Comments

supersyz commented Dec 12, 2024

kitcheng commented Dec 12, 2024

JackDainzh commented Dec 12, 2024

visualbruno commented Dec 13, 2024

realisticdreamer114514 commented Dec 14, 2024 • edited Loading

visualbruno commented Dec 14, 2024

cjjkoko commented Dec 15, 2024

visualbruno commented Dec 15, 2024

cjjkoko commented Dec 16, 2024

realisticdreamer114514 commented Dec 16, 2024

cjjkoko commented Dec 16, 2024

realisticdreamer114514 commented Dec 16, 2024 • edited Loading

cjjkoko commented Dec 16, 2024

realisticdreamer114514 commented Dec 18, 2024

cjjkoko commented Dec 18, 2024 • edited Loading

visualbruno commented Dec 18, 2024

QuantumLight0 commented Dec 19, 2024

visualbruno commented Dec 19, 2024

QuantumLight0 commented Dec 19, 2024

visualbruno commented Dec 19, 2024

cjjkoko commented Dec 25, 2024

realisticdreamer114514 commented Dec 14, 2024 •

edited

Loading

realisticdreamer114514 commented Dec 16, 2024 •

edited

Loading

cjjkoko commented Dec 18, 2024 •

edited

Loading