ORLA

AI-Powered 3D Model Generator

Features • Screenshots • Technology • Get Started

ORLA is a web application that transforms text prompts into detailed 3D models using advanced AI technologies. With an intuitive interface and powerful backend, ORLA enables users to generate high-quality 3D assets quickly and easily.

Screenshots

ORLA Homepage with main interface

Features

Text-to-3D Generation: Create 3D models from textual descriptions
Image-to-3D Conversion: Transform 2D images into detailed 3D models
Interactive 3D Viewer: Examine and interact with generated models in real-time
Multiple Output Images: View various angles and renderings of your 3D creation
User-Friendly Interface: Modern, responsive design built with React and TypeScript

Technology Stack

Frontend

React 19 with TypeScript
Three.js with React Three Fiber for 3D rendering
Tailwind CSS for styling
Vite as the build tool and development server

Backend

FastAPI for RESTful API endpoints
Celery for asynchronous task processing
Redis for task queue management
Stable Diffusion 3.5 AI model for image generation
Hunyuan3D-2 AI model for 3D model creation
Docker for containerization

Getting Started

Prerequisites

Node.js 18+ and npm
Python 3.9+
Docker and Docker Compose
Hugging Face API token (for accessing the Hunyuan3D-2 model)

Installation

Clone the repository

git clone https://github.com/AgarwalYash14/ORLA.git
cd ORLA

Set up the frontend
```
cd client
npm install
```
Set up the backend
```
cd ../server
```
Set environment variables Create a .env file in the server directory:
```
HUGGINGFACE_API_TOKEN=your_token_here
```
Start the application with Docker
```
docker-compose up --build
```
Access the application Open your browser and navigate to http://localhost:5173

Usage

Enter a text prompt describing the 3D model you want to generate (e.g., "a futuristic sports car")
Click "Generate" and wait for the AI to process your request through Stability AI's image generation API
Review the generated image optimized for 3D conversion (with white background and clear object definition)
Select the image to initiate 3D model creation through Hunyuan3D-2
Interact with your 3D model using mouse controls (orbit to rotate, scroll to zoom, right-click to pan)

Image Generation Process

ORLA uses advanced AI models to generate 3D assets through a multi-step process:

Text Prompt Processing: Your text prompt is enhanced with specific parameters optimized for 3D modeling
AI Image Generation: The backend uses Stability AI's Stable Diffusion 3.5 Large Turbo model to generate high-quality 2D renders
Image Optimization: Generated images are processed with specific attributes (isolated objects, solid background, uniform lighting, etc.)
3D Model Creation: Selected 2D images are transformed into 3D models using Tencent's Hunyuan3D-2 neural network model
Real-time Visualization: The 3D model is rendered in the browser using Three.js for interactive viewing

The technical workflow includes:

Text prompts sent to FastAPI endpoints
Enhanced prompts include specifications for optimal 3D conversion (low-poly style, sharp edges, white background, etc.)
Celery workers process the requests asynchronously with Stable Diffusion API
Generated images are saved and served from the server's static directory
Selected images are processed by Hunyuan3D-2 to create detailed 3D models
Redis manages the task queue for efficient processing of multiple requests

Project Structure

ORLA/
├── client/                # Frontend React application
│   ├── public/            # Static assets
│   └── src/
│       ├── assets/        # Images and other assets
│       ├── components/    # React components
│       ├── layout/        # Layout components
│       └── pages/         # Page components
└── server/                # Backend FastAPI application
    ├── app/
    │   ├── main.py        # FastAPI application
    │   ├── tasks.py       # Celery tasks
    │   └── model_generator.py # 3D model generation
    ├── Dockerfile         # Docker configuration
    └── docker-compose.yml # Docker Compose configuration

Development

Running the Frontend Locally

cd client
npm run dev

Running the Backend Locally

cd server
pip install -r requirements.txt
uvicorn app.main:app --reload

In a separate terminal:

# Start Redis
docker run -p 6379:6379 redis
# Start Celery worker
cd server
celery -A app.tasks worker --loglevel=info

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

Stable Diffusion 3.5 by Stability AI for image generation
Hunyuan3D-2 by Tencent for the 3D model generation
React Three Fiber for 3D rendering in React
FastAPI for the backend API

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
client		client
server		server
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ORLA

Screenshots

Features

Technology Stack

Frontend

Backend

Getting Started

Prerequisites

Installation

Usage

Image Generation Process

Project Structure

Development

Running the Frontend Locally

Running the Backend Locally

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

AgarwalYash14/ORLA

Folders and files

Latest commit

History

Repository files navigation

ORLA

Screenshots

Features

Technology Stack

Frontend

Backend

Getting Started

Prerequisites

Installation

Usage

Image Generation Process

Project Structure

Development

Running the Frontend Locally

Running the Backend Locally

Contributing

License

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages