Connect with experts and redefine what’s possible at work – join us at the Microsoft 365 Community Conference May 6-8. Learn more >

onnx

11 Topics

Getting Started Using Phi-3-mini-4k-instruct-onnx for Text Generation with NLP Techniques
In this tutorial, we'll cover how to use the Phi-3 mini models for text generation using NLP techniques. Whether you're a beginner or an experienced AI developer, you'll learn how to download and run these powerful tools on your own computer. From setting up the Python environment to generating responses with the generate() API, we'll provide clear instructions and code examples throughout the tutorial. So, let's get started and see what the Phi-3 mini models can do!
Lee_Stott
May 13, 2024 Place Educator Developer Blog
9.6KViews
1like
1Comment
Using Phi-3 & C# with ONNX for text and vision samples
In this blog post we explore the integration of Phi-3 Small Language Models and ONNX in .NET applications
elbruno
Jun 06, 2024 Place Educator Developer Blog
7.5KViews
2likes
0Comments
Use WebGPU + ONNX Runtime Web + Transformer.js to build RAG applications by Phi-3-mini
Learn how to harness the power of WebGPU, ONNX Runtime, and Web Transformer.js to create cutting-edge Retrieval-Augmented Generation (RAG) models. Dive into this technical guide and build intelligent applications that combine retrieval and generation seamlessly.
kinfey
Jul 15, 2024 Place Educator Developer Blog
6.8KViews
2likes
0Comments
GPU compute within Windows Subsystem for Linux 2 supports AI and ML workloads
Adding GPU compute support to WSL has been our #1 most requested feature since the first release. Over the last few years, the WSL, Virtualization, DirectX, Windows Driver, Windows AI teams, and our silicon partners have been working hard to deliver this capability.
Lee_Stott
Jul 16, 2020 Place Educator Developer Blog
5.9KViews
2likes
0Comments
Running Phi-3-vision via ONNX on Jetson Platform
Unlock the potential of NVIDIA's Jetson platform by running the Phi-3-vision model in ONNX format. Dive into the seamless process of compiling onnxruntime-genai, setting up the environment, and executing high-performance inference tasks on low-power devices like Jetson Orin Nano. Discover how to utilize quantized models efficiently, enabling robust image and text dialogue tasks, all while keeping your GPU workload-optimized. Whether you’re working with FP16 or Int 4 models, this guide will walk you through each step, ensuring you harness the full capabilities of edge AI on Jetson.
Jambo0321
Jul 19, 2024 Place Educator Developer Blog
5.8KViews
2likes
18Comments
Journey Series for Generative AI Application Architecture - Model references and evaluation models
In the previous content, we integrated the entire SLMOps process through Microsoft Olive. The development team can configure everything from data, fine-tuning, format conversion, deployment, etc. through Olive.config. In this article, I hope to talk about model reference and evaluation.
kinfey
Mar 25, 2024 Place Educator Developer Blog
5.8KViews
3likes
0Comments
AI and NET: AI in Action in Real-World .NET Applications
In this session, we will guide you through the process of building a large language model (LLM) from scratch, leveraging the power of ASP.NET, ONNX Runtime, TorchSharp and other AI tools.
elbruno
Jul 29, 2024 Place Educator Developer Blog
3.8KViews
1like
0Comments
Azure AI Gallery enables developers and data scientists to share their analytics solutions.
First published on MSDN on May 16, 2018 Azure AI Gallery is a community-driven site for discovering and sharing solutions.
Lee_Stott
Mar 21, 2019 Place Educator Developer Blog
1.4KViews
0likes
0Comments
ONNX and NPU Acceleration for Speech on ARM
This project is from Students at University College London and explores the benefits of ONNX and NPU accelerators in accelerating the inference of Whisper models and developing a local Whisper model leveraging these techniques for ARM-based systems.
ucab202
Oct 31, 2024 Place Educator Developer Blog
1.4KViews
0likes
0Comments
Windows 10 RS4 Preview for HoloLens and ONNX offline Machine Learning
First published on MSDN on Mar 29, 2018 Recently we announced that Windows 10 now includes the ability to run Open Neural Network Exchange (ONNX) models natively with hardware acceleration.
Lee_Stott
Mar 21, 2019 Place Educator Developer Blog
1.1KViews
0likes
0Comments