ORLA is a web application that transforms text prompts into detailed 3D models using advanced AI technologies. With an intuitive interface and powerful backend, ORLA enables users to generate high-quality 3D assets quickly and easily.
- Text-to-3D Generation: Create 3D models from textual descriptions
- Image-to-3D Conversion: Transform 2D images into detailed 3D models
- Interactive 3D Viewer: Examine and interact with generated models in real-time
- Multiple Output Images: View various angles and renderings of your 3D creation
- User-Friendly Interface: Modern, responsive design built with React and TypeScript
- React 19 with TypeScript
- Three.js with React Three Fiber for 3D rendering
- Tailwind CSS for styling
- Vite as the build tool and development server
- FastAPI for RESTful API endpoints
- Celery for asynchronous task processing
- Redis for task queue management
- Stable Diffusion 3.5 AI model for image generation
- Hunyuan3D-2 AI model for 3D model creation
- Docker for containerization
- Node.js 18+ and npm
- Python 3.9+
- Docker and Docker Compose
- Hugging Face API token (for accessing the Hunyuan3D-2 model)
-
Clone the repository
git clone https://github.com/AgarwalYash14/ORLA.git cd ORLA -
Set up the frontend
cd client npm install -
Set up the backend
cd ../server -
Set environment variables Create a
.envfile in the server directory:HUGGINGFACE_API_TOKEN=your_token_here -
Start the application with Docker
docker-compose up --build
-
Access the application Open your browser and navigate to
http://localhost:5173
- Enter a text prompt describing the 3D model you want to generate (e.g., "a futuristic sports car")
- Click "Generate" and wait for the AI to process your request through Stability AI's image generation API
- Review the generated image optimized for 3D conversion (with white background and clear object definition)
- Select the image to initiate 3D model creation through Hunyuan3D-2
- Interact with your 3D model using mouse controls (orbit to rotate, scroll to zoom, right-click to pan)
ORLA uses advanced AI models to generate 3D assets through a multi-step process:
- Text Prompt Processing: Your text prompt is enhanced with specific parameters optimized for 3D modeling
- AI Image Generation: The backend uses Stability AI's Stable Diffusion 3.5 Large Turbo model to generate high-quality 2D renders
- Image Optimization: Generated images are processed with specific attributes (isolated objects, solid background, uniform lighting, etc.)
- 3D Model Creation: Selected 2D images are transformed into 3D models using Tencent's Hunyuan3D-2 neural network model
- Real-time Visualization: The 3D model is rendered in the browser using Three.js for interactive viewing
The technical workflow includes:
- Text prompts sent to FastAPI endpoints
- Enhanced prompts include specifications for optimal 3D conversion (low-poly style, sharp edges, white background, etc.)
- Celery workers process the requests asynchronously with Stable Diffusion API
- Generated images are saved and served from the server's static directory
- Selected images are processed by Hunyuan3D-2 to create detailed 3D models
- Redis manages the task queue for efficient processing of multiple requests
ORLA/
├── client/ # Frontend React application
│ ├── public/ # Static assets
│ └── src/
│ ├── assets/ # Images and other assets
│ ├── components/ # React components
│ ├── layout/ # Layout components
│ └── pages/ # Page components
└── server/ # Backend FastAPI application
├── app/
│ ├── main.py # FastAPI application
│ ├── tasks.py # Celery tasks
│ └── model_generator.py # 3D model generation
├── Dockerfile # Docker configuration
└── docker-compose.yml # Docker Compose configuration
cd client
npm run devcd server
pip install -r requirements.txt
uvicorn app.main:app --reloadIn a separate terminal:
# Start Redis
docker run -p 6379:6379 redis
# Start Celery worker
cd server
celery -A app.tasks worker --loglevel=infoContributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
- Stable Diffusion 3.5 by Stability AI for image generation
- Hunyuan3D-2 by Tencent for the 3D model generation
- React Three Fiber for 3D rendering in React
- FastAPI for the backend API

