QCRI
/

QCRI/AZERG-MixTask-Mistral

This model is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.3 specialized for Cyber Threat Intelligence (CTI) tasks. It was trained on the AZERG Dataset covering a mixture of all four tasks required for STIX data generation:

  • T1: Entity Detection
  • T2: Entity Type Identification
  • T3: Related Pair Detection
  • T4: Relationship Type Identification

This is the most versatile model in the AZERG collection, capable of handling all STIX extraction sub-tasks.

Intended Use

This model is intended to be used within the AZERG framework to extract STIX entities and relationships from security reports. Please check the exact prompts in the framework.

Example Prompt (Task 1: Entity Detection):

Instruction:
You are a helpful threat intelligence analyst. Your task is to extract all STIX entities mentioned in the input. To help you, here is a list of the possible STIX entity types.
STIX entity types:
- ATTACK_PATTERN: A type of TTP that describes ways that adversaries attempt to compromise targets. (e.g., T1051, T1548.001, etc.)
[...]

Answer in the following format: <entities>LIST OF IDENTIFIED ENTITIES SEPARATED BY PIPE |</entities>

Input:
- Text Passage: [INPUT TEXT]

Response:

Citation

If you use this model, please cite our paper:

@article{lekssays2025azerg,
  title={From Text to Actionable Intelligence: Automating STIX Entity and Relationship Extraction},
  author={Lekssays, Ahmed and Sencar, Husrev Taha and Yu, Ting},
  journal={arXiv preprint arXiv:2507.16576},
  year={2025}
}
Downloads last month
14
Safetensors
Model size
7B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for QCRI/AZERG-MixTask-Mistral

Finetuned
(362)
this model
Quantizations
1 model

Dataset used to train QCRI/AZERG-MixTask-Mistral

Collection including QCRI/AZERG-MixTask-Mistral