Please drop us a message or reach out to connect@arfniia.com to request the free trial access and get started with launching the AMI.
We recommend using the 4th generation of Intel Xeon CPU instances, such as m7i.2xlarge or higher, to ensure optimal efficiency. Arfniia Router is specifically optimized for these architectures, offering up to 2x better performance per routing step with default configurations, compared to 3rd generation instances.
Configure IAM
Arfniia Router exclusively uses IAM roles instead of access keys for enhanced security. After launching the Arfniia Router EC2 instance, please make sure a role is attached.
Steps to configure IAM:
Grant the EC2 role self-assuming permission.
Grant the EC2 role permission to invoke Amazon Bedrock models.
Create a Router
Send a POST request to /v1/routers endpoint, e.g. creating a router with the following configs:
Choose between 2 base models, Llama 3.1 405B or Claude 3.5 Sonnet
Leverage amazon.titan-embed-text-v2:0 as part of prompt understanding
Maximize the feedback value, which will be in range [0, 1]
Calculate the reward 100% on feedback, ignoring cost factors
Apply a cosine similarity threshold of 0.95 as part of the sampling strategy
At each routing step, train for 5 epochs with 16 samples
Ready!
Send prompts to the OpenAI compatible endpoint, and Arfniia Router will automatically learn and serve the router using a Reinforcement Learning algorithm.