Search results for "EVAL"

07:22

The next version of the Yuntian Tianshu model will be benchmarked against GPT4.0 to further improve multimodal capabilities

Yuntian Lifei recently said in an institutional survey that the company's self-developed 100-billion-level large model - Yuntiantianshu large model has completed 2 version updates, and its comprehensive capabilities have been further improved, reaching the advanced level in the industry in general question answering, language understanding, mathematical reasoning, text generation, role-playing, etc.; in the C-Eval Chinese large model list in early September this year, the Yuntiantianshu large model ranked first on the list; the next version of the Yuntiantianshu large model will benchmark against GPT4.0 to further improve multimodal capabilities.

More

1

03:31

Models are “new every day”: SenseTime’s “SenseChat 2.0” comprehensive performance on multiple evaluation benchmarks exceeds that of ChatGPT

SenseTime recently announced the results of its self-developed Chinese language model "SenseChat 2.0" on three authoritative large language model evaluation benchmarks: MMLU, AGIEval, and C-Eval. According to the evaluation results, "Discuss SenseChat 2.0" outperformed ChatGPT in the three test sets, achieving an important breakthrough in the research of large language models in my country.

More

Load More

Hot Tags

Hot Topics

Crypto Calendar

Legacy Mainnet Shutdown

Neo has issued an official reminder that the Neo Legacy MainNet will be shut down on October 31. Users are urged to complete their asset migration before the deadline to avoid the risk of losing funds. The Legacy network, originally launched as AntShares MainNet in 2016, will be fully decommissioned, marking the end of its operational phase within the Neo ecosystem.

Seattle AI Week in Seattle

Arcblock plans to unveil a new partnership during Seattle AI Week on October 27th-31st. The conference is expected to attract more than 3,500 attendees and lists Coinbase, Accenture and other companies as sponsors.

Flow launches Forte Hacks, a virtual hackathon offering over $250,000 in prizes and perks, starting on October 1-31. The event aims to explore the full potential of the Flow ecosystem. Forte is now live on the Flow testnet, allowing developers to get an early start on their projects before the hackathon begins.

Cosmoverse in Split

Cosmos will host Cosmoverse 2025 in Split, Croatia, on October 30 – November 1, bringing together blockchain developers, ecosystem contributors, and policy experts for three days of panels, workshops, and networking.

Ripple Swell 2025 in New York

Ripple announced that its flagship event, Ripple Swell, will return to New York on November 3rd-5th.