arbisoft brand logo
arbisoft brand logo

A Technology Partnership That Goes Beyond Code

  • company logo

    “Arbisoft is an integral part of our team and we probably wouldn't be here today without them. Some of their team has worked with us for 5-8 years and we've built a trusted business relationship. We share successes together.”

    Jake Peters profile picture

    Jake Peters/CEO & Co-Founder, PayPerks

  • company logo

    “They delivered a high-quality product and their customer service was excellent. We’ve had other teams approach us, asking to use it for their own projects”.

    Alice Danon profile picture

    Alice Danon/Project Coordinator, World Bank

1000+Tech Experts

550+Projects Completed

50+Tech Stacks

100+Tech Partnerships

4Global Offices

4.9Clutch Rating

Trending Blogs

    81.8% NPS78% of our clients believe that Arbisoft is better than most other providers they have worked with.

    • Arbisoft is your one-stop shop when it comes to your eLearning needs. Our Ed-tech services are designed to improve the learning experience and simplify educational operations.

      Companies that we have worked with

      • MIT logo
      • edx logo
      • Philanthropy University logo
      • Ten Marks logo

      • company logo

        “Arbisoft has been a valued partner to edX since 2013. We work with their engineers day in and day out to advance the Open edX platform and support our learners across the world.”

        Ed Zarecor profile picture

        Ed Zarecor/Senior Director & Head of Engineering

    • Get cutting-edge travel tech solutions that cater to your users’ every need. We have been employing the latest technology to build custom travel solutions for our clients since 2007.

      Companies that we have worked with

      • Kayak logo
      • Travelliance logo
      • SastaTicket logo
      • Wanderu logo

      • company logo

        “Arbisoft has been my most trusted technology partner for now over 15 years. Arbisoft has very unique methods of recruiting and training, and the results demonstrate that. They have great teams, great positive attitudes and great communication.”

        Paul English profile picture

        Paul English/Co-Founder, KAYAK

    • As a long-time contributor to the healthcare industry, we have been at the forefront of developing custom healthcare technology solutions that have benefitted millions.

      Companies that we have worked with

      • eHuman logo
      • Reify Health logo

      • company logo

        I wanted to tell you how much I appreciate the work you and your team have been doing of all the overseas teams I've worked with, yours is the most communicative, most responsive and most talented.

        Matt Hasel profile picture

        Matt Hasel/Program Manager, eHuman

    • We take pride in meeting the most complex needs of our clients and developing stellar fintech solutions that deliver the greatest value in every aspect.

      Companies that we have worked with

      • Payperks logo
      • The World Bank logo
      • Lendaid logo

      • company logo

        “Arbisoft is an integral part of our team and we probably wouldn't be here today without them. Some of their team has worked with us for 5-8 years and we've built a trusted business relationship. We share successes together.”

        Jake Peters profile picture

        Jake Peters/CEO & Co-Founder, PayPerks

    • Unlock innovative solutions for your e-commerce business with Arbisoft’s seasoned workforce. Reach out to us with your needs and let’s get to work!

      Companies that we have worked with

      • HyperJar logo
      • Edited logo

      • company logo

        The development team at Arbisoft is very skilled and proactive. They communicate well, raise concerns when they think a development approach wont work and go out of their way to ensure client needs are met.

        Veronika Sonsev profile picture

        Veronika Sonsev/Co-Founder

    • Arbisoft is a holistic technology partner, adept at tailoring solutions that cater to business needs across industries. Partner with us to go from conception to completion!

      Companies that we have worked with

      • Indeed logo
      • Predict.io logo
      • Cerp logo
      • Wigo logo

      • company logo

        “The app has generated significant revenue and received industry awards, which is attributed to Arbisoft’s work. Team members are proactive, collaborative, and responsive”.

        Silvan Rath profile picture

        Silvan Rath/CEO, Predict.io

    • Software Development Outsourcing

      Building your software with our expert team.

    • Dedicated Teams

      Long term, integrated teams for your project success

    • IT Staff Augmentation

      Quick engagement to boost your team.

    • New Venture Partnership

      Collaborative launch for your business success.

    Discover More

    Hear From Our Clients

    • company logo

      “Arbisoft partnered with Travelliance (TVA) to develop Accounting, Reporting, & Operations solutions. We helped cut downtime to zero, providing 24/7 support, and making sure their database of 7 million users functions smoothly.”

      Dori Hotoran profile picture

      Dori Hotoran/Director Global Operations - Travelliance

    • company logo

      “I couldn’t be more pleased with the Arbisoft team. Their engineering product is top-notch, as is their client relations and account management. From the beginning, they felt like members of our own team—true partners rather than vendors.”

      Diemand-Yauman profile picture

      Diemand-Yauman/CEO, Philanthropy University

    • company logo

      Arbisoft was an invaluable partner in developing TripScanner, as they served as my outsourced website and software development team. Arbisoft did an incredible job, building TripScanner end-to-end, and completing the project on time and within budget at a fraction of the cost of a US-based developer.

      Ethan Laub profile picture

      Ethan Laub/Founder and CEO

    Contact Us
    contact

    Anthropic - Introducing New Computer Capabilities with Claude 3.5 Sonnet and Claude 3.5 Haiku

    October 24, 2024
    https://d1foa0aaimjyw4.cloudfront.net/Newsletter_Cover_b98d0226b3.jpg

    Anthropic has introduced two major upgrades to its AI lineup; Claude 3.5 Sonnet and Claude 3.5 Haiku. Alongside these advancements, a new computer use feature has been launched in a public beta. These developments push the boundaries of automation, coding, and computer navigation, bringing new possibilities for developers and businesses alike. 

     

    Claude 3.5 Sonnet: Enhancing Software Engineering

    The Claude 3.5 Sonnet offers significant upgrades over its previous version, with enhanced abilities in coding and automation. This model shines in agentic coding tasks, improving its performance on benchmarks like SWE-bench Verified, moving from 33.4% to 49%, outperforming publicly available models, including OpenAI’s o1-preview. It also scored higher on the TAU bench, used to assess tool-based problem-solving: 

     

    • Retail domain: From 62.6% to 69.2%
    • Airline domain: From 36% to 46%

     

    These gains come with no added cost or latency, making Claude 3.5 Sonnet an ideal solution for complex, multi-step development tasks. Companies such as GitLab have reported up to 10% better reasoning on DevSecOps tasks. The Browser Company also found the model to be exceptional for automating web-based workflows. 

     

    This model has been tested rigorously in partnership with the US and UK AI Safety Institutes to ensure safe deployment. Its compliance with the ASL-2 Standard, part of Anthropic’s Responsible Scaling Policy, confirms that it meets safety benchmarks required for broader use.

     

    Screenshot 2024-10-24 at 2.31.52 PM.png

    Image Source: Anthropic

     

    Claude 3.5 Haiku: Affordable, Fast, and Capable AI

    The new Claude 3.5 Haiku model is designed for speed and cost-efficiency while matching the performance of Claude 3 Opus, which is the Anthropic’s largest previous model, across many evaluations. This model demonstrates excellent results in low-latency tasks, making it suitable for real-time applications like user-facing products and data-intensive tasks.

     

    Claude 3.5 Haiku scores 40.6% on SWE-bench Verified, outperforming earlier Claude models and even GPT-4o in some areas. It provides accurate tool usage and improved instruction-following capabilities, making it effective for generating personalized experiences from large datasets, such as purchase history, pricing records, or inventory data.

     

    This model will be available later in October, via Anthropic’s API, Amazon Bedrock, and Google Cloud Vertex AI. Initially, it will support text-only tasks, with image input functionality expected soon.  

     

    AI-Driven Computer Use in Public Beta

    One of the most exciting features Anthropic has introduced is Claude’s ability to use computers. Now in public beta, developers can use Claude to perform tasks just like a human, such as navigating screens, typing, clicking, and more. This feature allows the model to automate repetitive processes, conduct open-ended research, and even test software across multiple platforms. 



    Early adopters like Replit are already using this capability for automating complex UI navigation tasks, helping their Replit Agent product evaluate applications as they are developed. 

     

    In tests conducted by OSWorld, Claude 3.5 Sonner scored 22% when given more time to complete a task, outperforming other AI models that scored just 7.8%. Even so, the feature is still experimental and has some limitations. Tasks that require scrolling, zooming, or dragging can be challenging for the AI to perform smoothly. Developers are advised to start with low-risk projects to explore its potential. Anthropic promises ongoing improvements to this feature based on the feedback. 

     

    Ensuring Safe Deployment

    To address concerns around security risks, such as spam, fraud, or misinformation, Anthropic has developed new classifiers to monitor and prevent misuse of the computer use feature. This proactive approach helps ensure the responsible deployment of AI-driven automation. 

     

    Dataset and Training Details of Claude Models

    According to Google Cloud, all Claude models are trained through several techniques:

    • Unsupervised learning (learning from patterns in raw data)
    • Reinforcement Learning with Human Feedback (RLHF) (improving with feedback from people)
    • Constitutional AI (a process involving both supervised learning and reinforcement learning).

     

    Training Infrastructure

    Claude 3.5 Sonnet v2 is trained using cloud services provided by Amazon Web Services (AWS) and Google Cloud Platform (GCP). The main frameworks used for development include PyTorch, JAX, and Triton.

     

    Sources of Training Data

    Claude models use a mix of data that includes:

    1. Public internet information that was collected up to August 2023, with Claude 3.5 Sonnet v2’s training ending in April 2024.
    2. Non-public data from third parties, which includes content created or labeled by users, companies, or hired service providers.
    3. Internally generated data by Anthropic for refining the model.

     

    Data Cleaning and Filtering

    To ensure high-quality data, Anthropic applies methods like deduplication (removing repeated information) and classification to filter out irrelevant or low-quality data.

    Crawling Practices

    When gathering public data from websites, Anthropic follows responsible crawling practices:

    1. robots.txt files and other website signals are respected to ensure compliance with site owners' preferences.
    2. Anthropic does not access password-protected or sign-in pages or bypass CAPTCHAs to collect data.
    3. Their web-crawling system operates transparently, making it easy for site owners to identify visits and communicate their preferences to Anthropic.

     

    What’s Next

    The Claude Sonnet 3.5 is already available for use, and the Claude 3.5 Haiku will be released later in October. Both models along with the computer use feature, can be accessed via Anthropic’s API, Amazon Bedrock, and Google Cloud Vertex AI. As these innovations evolve, Anthropic invites developers to provide feedback and experiment with these tools in safe practical applications.

      Share on
      https://d1foa0aaimjyw4.cloudfront.net/image_7c49cbff76.png

      Amna Manzoor

      I have nearly five years of experience in content and digital marketing, and I am focusing on expanding my expertise in product management. I have experience working with a Silicon Valley SaaS company, and I’m currently at Arbisoft, where I’m excited to learn and grow in my professional journey.

      Related blogs

      0

      Let’s talk about your next project

      Contact us