Agentic AI is beginning to reshape malware detection and broader security operations. These systems are being used not to replace humans, but to take on the lower value jobs that have historically tied up analysts — from triaging alerts to reverse-engineering suspicious files.
MarkItDown is an open-source Python utility that simplifies converting diverse file formats into Markdown, designed to prepare data for LLMs and RAG systems. It handles various file types, preserves document structure, and integrates with LLMs for tasks like image description.
Microsoft researchers introduce LongRoPE2, a method to extend large language model context windows to 128K tokens while maintaining over 97% short-context accuracy, addressing key limitations in positional embeddings.
Microsoft has open-sourced MarkItDown, a state-of-the-art application designed to convert various file types into Markdown format for seamless integration, collaboration, and accessibility. The tool supports multiple file formats, including PDFs, PowerPoint presentations, Word documents, Excel spreadsheets, images, audio, HTML, text-based formats, and ZIP files, making it a versatile utility for users across different domains.
Microsoft has released the OmniParser model on HuggingFace, a vision-based tool designed to parse UI screenshots into structured elements, enhancing intelligent GUI automation across platforms without relying on additional contextual data.
OpenRecall is an open-source software that aims to be a privacy-focused alternative to Microsoft's Recall feature. It captures the user's digital history, processes text and images using OCR, and allows users to find specific information by searching for relevant keywords. Currently, it stores data locally but does not encrypt it. It is available for Windows, macOS, and Linux.
This article provides a step-by-step guide on fine-tuning the Florence-2 model for object detection tasks, including loading the pre-trained model, fine-tuning with a custom dataset, and evaluating the model's performance.
Microsoft has deployed GPT-4, a large language model, in an isolated, air-gapped Azure Government Top Secret cloud for use by the Department of Defense. Once accredited, Pentagon officials will be able to use the technology in a secure environment. The tool is expected to help DOD officials deal with vast amounts of data and simplify information sorting. Microsoft is a major investor in OpenAI, the maker of GPT-4 and the popular ChatGPT.
Stay informed about the latest artificial intelligence (AI) terminology with this comprehensive glossary. From algorithm and AI ethics to generative AI and overfitting, learn the essential AI terms that will help you sound smart over drinks or impress in a job interview.
Microsoft revealed a new AI tool called Infra Copilot, which uses its existing GitHub Copilot to create infrastructure code.
Infra Copilot is designed to understand the context of infrastructure tasks and generate appropriate code suggestions based on natural language prompts.
The tool can streamline the coding process, enabling professionals to focus on higher-level tasks.
It also provides standardized code snippets for consistency across different environments.
Infra Copilot is available now to programmers with a recent Visual Studio Code version and a GitHub Copilot license.
Microsoft has also launched GitHub Copilot Enterprise, using data from a company's own code repositories to generate code and answer questions, priced at $39 per month per user.