Fun
November 6, 2024

Guide to Controllable AI Image Generation: Structure Control

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5
Heading 6

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.


        const greeting = "Hello, world!";
        function sayHello() {
            console.log(greeting);
        }
        
Block quote Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

sdsds

Ordered list

  1. Item 1
  2. Item 2
  3. Item 3

Unordered list

  • Item A
  • Item B
  • Item C

Text link

Bold text

sdsdsd

Emphasis

Superscript

Subscript

Structure Control

In the rapidly evolving world of AI image generation, multimodal controls stand out as a cornerstone for creators looking to harness technology to enhance their artistic vision. These controls integrate different modes of input—text, images, and direct manipulation—to refine the output of AI generators. Among these, structure control is a pivotal feature, particularly in the domain of precise composition and detail replication.

Structure control is an advanced feature designed to meticulously replicate the semantic details and overall composition of an input image. In the backend, we compute structure as depth using a neural network. This capability is not about copying an image but about understanding and preserving its underlying structure—the arrangement of elements, the interaction of shapes, and the spatial distribution within the image. Such a feature is invaluable for artists and designers who wish to maintain the integrity of the original composition while infusing new elements or stylistic changes.

structure control AI image generation

Getting Started with Structure Control

To begin using structure control, users should select an image that they would like to use and upload it through the NEX toolbar, then select the image and set it as ‘Structure Image’.

  1. Open a new Artboard
structure image upload for AI image generation
  1. Upload your desired photo in the Assets tab
AI image generation Assets tab
uploading an input image for AI image generation Assets tab
  1. Set your image as 'Structure Image' and see it appear in the action bar below
uploading an input image for AI image generation Assets tab
  1. Adjust the strength of your image, and enter a prompt
uploaded structure image for AI image generation
  1. Generate!
generated AI images from structure image inputs

While structure control is powerful, it requires careful consideration of the input image. Here are a few tips:

  • Choose Clear, Well-Defined Images: The clearer the structural elements in the input image, the better the AI can replicate them.
  • Mind the Complexity: Overly complex images might result in muddy or confused outputs. Simplicity often leads to more coherent results.

Use Case Example

After you've explored the functionalities of structure control, consider a designer who uses an iconic cityscape as the structure for a futuristic redesign. By applying structure control, they can maintain the original cityscape’s layout while transforming its buildings into futuristic structures, effectively merging past and future without losing the essence of the composition.

AI generated image using a structure image input

Looking Ahead

Structure control is just the tip of the iceberg. As we continue to refine these tools, the potential for AI in creative fields becomes even more boundless. Stay tuned for more insights and tutorials on using other multimodal controls to fully unleash your creativity with AI image generation.

By harnessing tools like structure control, artists and creators are equipped to push the boundaries of traditional and digital art forms, opening up a new realm of possibilities for innovation and expression in the AI age.

If you have any questions for our team, we would be happy to answer them at outreach@nex.art!