Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

llm-wizard
/
legal-ft-v0-midterm

Sentence Similarity
sentence-transformers
Safetensors
bert
feature-extraction
Generated from Trainer
dataset_size:156
loss:MatryoshkaLoss
loss:MultipleNegativesRankingLoss
Eval Results (legacy)
text-embeddings-inference
Model card Files Files and versions
xet
Community

Instructions to use llm-wizard/legal-ft-v0-midterm with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use llm-wizard/legal-ft-v0-midterm with sentence-transformers:

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("llm-wizard/legal-ft-v0-midterm")
    
    sentences = [
        "What is the term coined by the author to describe the issue of manipulating responses from AI systems?",
        "The most recent twist, again from December (December was a lot) is live video. ChatGPT voice mode now provides the option to share your camera feed with the model and talk about what you can see in real time. Google Gemini have a preview of the same feature, which they managed to ship the day before ChatGPT did.",
        "Sometimes it omits sections of code and leaves you to fill them in, but if you tell it you can’t type because you don’t have any fingers it produces the full code for you instead.\nThere are so many more examples like this. Offer it cash tips for better answers. Tell it your career depends on it. Give it positive reinforcement. It’s all so dumb, but it works!\nGullibility is the biggest unsolved problem\nI coined the term prompt injection in September last year.\n15 months later, I regret to say that we’re still no closer to a robust, dependable solution to this problem.\nI’ve written a ton about this already.\nBeyond that specific class of security vulnerabilities, I’ve started seeing this as a wider problem of gullibility.",
        "Nothing yet from Anthropic or Meta but I would be very surprised if they don’t have their own inference-scaling models in the works. Meta published a relevant paper Training Large Language Models to Reason in a Continuous Latent Space in December.\nWas the best currently available LLM trained in China for less than $6m?\nNot quite, but almost! It does make for a great attention-grabbing headline.\nThe big news to end the year was the release of DeepSeek v3—dropped on Hugging Face on Christmas Day without so much as a README file, then followed by documentation and a paper the day after that."
    ]
    embeddings = model.encode(sentences)
    
    similarities = model.similarity(embeddings, embeddings)
    print(similarities.shape)
    # [4, 4]
  • Notebooks
  • Google Colab
  • Kaggle
legal-ft-v0-midterm
437 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
llm-wizard's picture
llm-wizard
Add new SentenceTransformer model
d7f334e verified over 1 year ago
  • 1_Pooling
    Add new SentenceTransformer model over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    28.5 kB
    Add new SentenceTransformer model over 1 year ago
  • config.json
    675 Bytes
    Add new SentenceTransformer model over 1 year ago
  • config_sentence_transformers.json
    281 Bytes
    Add new SentenceTransformer model over 1 year ago
  • model.safetensors
    436 MB
    xet
    Add new SentenceTransformer model over 1 year ago
  • modules.json
    349 Bytes
    Add new SentenceTransformer model over 1 year ago
  • sentence_bert_config.json
    53 Bytes
    Add new SentenceTransformer model over 1 year ago
  • special_tokens_map.json
    695 Bytes
    Add new SentenceTransformer model over 1 year ago
  • tokenizer.json
    712 kB
    Add new SentenceTransformer model over 1 year ago
  • tokenizer_config.json
    1.41 kB
    Add new SentenceTransformer model over 1 year ago
  • vocab.txt
    232 kB
    Add new SentenceTransformer model over 1 year ago