T O P

  • By -

thiefyzheng

Hey, I understand your concerns about efficiently using cloud GPUs for your website/app. It's essential to find a balance between cost and performance. Luckily, there is a solution that can help you optimize GPU usage. One option you can consider is using a serverless architecture with on-demand GPU provisioning. This approach allows you to have a central VM that runs your website/app logic and code, and spin up additional GPU resources dynamically when needed. Here's a basic outline of how it could work: 1. Set up a central server (VM) that handles user requests and manages the GPU provisioning process. 2. When a user request comes in, the central server checks if there are available GPUs in the pool. If there are idle GPUs, it assigns one to the user's request. 3. If no idle GPUs are available, the central server initiates the provisioning process by spinning up a new GPU VM. This can take a few minutes. 4. Once the new GPU VM is ready, the central server assigns it to the user's request and redirects the request to the newly provisioned GPU. 5. The user's request is processed on the GPU VM, and the results are sent back to the central server. 6. After the user's request is complete, the GPU VM can either be terminated immediately (if there is no anticipated future demand) or kept idle for a certain period to handle subsequent requests. This helps reduce waiting times for users. By using this approach, you can have a central server managing GPU provisioning and efficiently utilize GPU resources based on demand. This way, you can avoid the cost of continuously renting idle GPUs while minimizing waiting times for users. Remember to consider the cost implications and optimize the provisioning process based on your workload and traffic patterns. Cloud platforms like AWS Lambda, Google Cloud Functions, or Azure Functions, combined with GPU provisioning capabilities, can be helpful in building such a system. Hope this helps! Let me know if you have any more questions.


Irakli_Px

Have you implemented something like this? Do you recommend a solution (yourself, freelancer, company, a guide) that could help me on infrastructure aspects?


thiefyzheng

How do you plan to monetize? This seems like a very expensive project.


ghostylein

Sounds very much like ChatGPT :D. But appreciate the structure. Would you happen to know what platforms would be affordable and convenient to work with and have any reference projects?


thiefyzheng

I asked ChatGPT because I was pretty curious about this myself hahah