Saturday, July 13, 2024

Exploring the Evolution of Text Classification in Healthcare Discussions

 

Source paper: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=7433297

In the ever-evolving landscape of Natural Language Processing (NLP), one of the most exciting developments has been the rise of Transformer models. These models have revolutionized the field much like Convolutional Neural Networks did for computer vision. Our recent study, "From Bag-of-Words to Transformers: A Comparative Study for Text Classification in Healthcare Discussions in Social Media", delves into this paradigm shift, focusing on how different text representation techniques can classify healthcare-related social media posts.

The crux of our research lies in comparing traditional methods like Bag-of-Words (BoW) with state-of-the-art models such as BERT, Mistral, and GPT-4. We aimed to tackle the inherent challenges posed by short, noisy texts in Italian, a language often underrepresented in NLP research.

We employed two primary datasets: leaflets of medical products to train word embedding models and Facebook posts from groups discussing various medical topics. This dual-dataset approach allowed us to test the effectiveness of different text representation techniques thoroughly.

Our findings revealed a clear winner in the form of the Mistral embedding model, which achieved a staggering balanced accuracy of 99.4%. This model's superior performance underscores its powerful semantic capabilities, even in a niche application involving the Italian language. Another standout performer was BERT, particularly when fine-tuned, reaching a balanced accuracy of 92%. These results highlight the significant leap in performance that modern Transformer models offer over traditional methods.

Interestingly, while the classic BERT architecture proved highly effective, our exploration of hybrid models combining BERT with Support Vector Machines (SVM) yielded mixed results. This indicates that while BERT’s contextual embeddings are robust, integrating them with other classifiers requires careful parameter optimization to unlock their full potential.

In contrast, traditional methods like BoW and TF-IDF struggled with the classification task, reaffirming the need for more sophisticated models in handling complex language data. However, word embedding techniques, particularly when paired with rigorous pre-processing, still present a viable alternative, especially in computationally constrained environments.

A particularly exciting aspect of our study was examining GPT-4's in-context learning capabilities. GPT-4 demonstrated impressive adaptability, providing not only high classification accuracy but also nuanced understanding and semantic explanations for its decisions. This ability to perform in-context learning opens new avenues for developing more intuitive and explainable AI systems.

Our research has significant implications for healthcare professionals and researchers. By leveraging advanced NLP models, we can gain deeper insights into patient experiences and public health trends, potentially transforming how we monitor and combat misinformation in medical discussions online.

In conclusion, our study reaffirms the transformative power of modern LLMs in text classification tasks, particularly in challenging contexts like healthcare-related social media posts. As we continue to explore and refine these models, we move closer to harnessing their full potential in real-world applications, ultimately enhancing our ability to process and understand the vast amounts of textual data generated every day.


 

Tuesday, July 9, 2024

The term "Computational Intelligence" in Japanese: 計算知能 (Keisan Chinou)


Recently, our team had an enriching experience at WCCI 2024 in Yokohama, Japan, where we delved into the fascinating world of Computational Intelligence. In honor of our time there, I want to share a deeper look at the Japanese term for "Computational Intelligence": 計算知能 (Keisan Chinou).

The phrase was written during the gala dinner, mimicking the Japanese ritual that sees writing as a profound meditative moment.

Explaining each kanji forming  "計算知能"

  1. 計 (Kei) - Plan, Measure

    • This kanji is often associated with planning, measuring, or computing. It represents the careful consideration and calculation required in computational tasks. The left part of the character, 言, relates to speech or words, indicating systematic communication or planning. The right part, 十, is the number ten, symbolizing completeness or thoroughness in planning.
  2. 算 (San) - Calculate, Reckon

    • This kanji signifies calculation and arithmetic. It’s an essential component in mathematical operations. The character consists of 竹 (bamboo) at the top, suggesting the counting rods used in ancient China, and 目 (eye) at the bottom, indicating observation or scrutiny.
  3. 知 (Chi) - Knowledge, Wisdom

    • Representing knowledge or wisdom, this kanji emphasizes the intelligence aspect. It is composed of 矢 (arrow) on the left, symbolizing directness or precision, and 口 (mouth) on the right, denoting speech or communication, implying the conveyance of knowledge.
  4. 能 (Nou) - Ability, Capability

    • This kanji represents ability or capability, crucial to the concept of intelligence. It combines 月 (moon) on the left, which can signify time or cycles, and 匕 (spoon) on the right, representing the ability to nourish or support, metaphorically relating to capabilities and talents.

When combined, 計算知能 (Keisan Chinou) beautifully encapsulates the essence of Computational Intelligence:

  • 計 (Kei) and 算 (San) together emphasize the methodical and precise nature of computation.
  • 知 (Chi) and 能 (Nou) highlight the knowledge and capabilities that form the foundation of intelligence.

Stay tuned for more insights and experiences from our journey in the world of Computational Intelligence!

The CIPARLABS team at the World Congress on Computational Intellgience 2024 in Yokohama, Japan

The CIPARLABS team has just returned from WCCI 2024 — the world's largest conference on Computational Intelligence, Artificial Neural Networks, and Fuzzy Systems — held in Yokohama, Japan, from June 29 to July 5.

It was an exciting group experience that combined moments of learning and reflection with convivial moments where everyone could express themselves beyond academic performance.

The conference was organized at a large convention center, the Pacific Conference Center in Yokohama. The organization was lacking both in the paper review process and in logistics. Despite paying a high participation fee, no lunch was provided. The coffee break was meager and of insufficient duration for networking needs, especially in the hall where the poster session was held. Some special sessions were also organized in a haphazard manner. We expected greater participation from sponsors and companies. Apart from the poor logistical and organizational aspects, numerous interesting works were presented. The poster session featured some truly noteworthy works.


Our PhD students Sabereh Taghdisi Rastkar and Danial Zendehdel presented two papers at IJCNN (the sub-conference on Artificial Neural Networks) on optimization and prediction problems in the field of smart energy systems, specifically within Renewable Energy Communities. Our CIPARLABS research group is focusing its efforts on this topic. In particular, we are working on the design of an optimized Energy Management System with Computational Intelligence algorithms, complemented by energy variable prediction algorithms based on Deep Learning techniques. The design is conceived to fully comply with the technical requirements of the major European countries developing Renewable Energy Communities according to the European Green Deal and NextGenerationEU funds.


The gala dinner was plentiful, and we had the opportunity to experience Japanese culture both from a culinary perspective and in customs, as is usual at conferences of this kind. The ceremony was thrilling, featuring actors in traditional Japanese costumes writing the words "Computational Intelligence" on a large canvas using Japanese ideograms. It is well known that in Japan, writing is a ritual in itself, a personal and artistic performance worth appreciating.

Japan proved to be a crossroads of cultures as distant as the Western and Eastern ones. We visited Tokyo, a commercial city hosting Buddhist and Shinto temples. Tokyo showcases the contradictions of a country that has experienced numerous influences from China, Korea, and other Asian countries, as well as from the United States. The Japanese are distinguished by their kindness and the rituals related to gratitude, so different from us Westerners. The Japanese are both very serious and very self-ironic, a reflection of the animated characters and manga whose main figures are featured everywhere.

In Yokohama, we had the chance to appreciate the port and waterfront, one of the largest China Towns in Japan, and the nightlife area, one of the oldest in the city.

In Tokyo, among many places, we saw crowded districts like Shibuya and visited the old quarter full of small venues for nightlife. We also visited the Sensō-ji (金龍山浅草寺, Kinryū-zan Sensō-ji), a Buddhist temple complex located in the Asakusa district.



With the limited time available, we also visited Karakura, another area rich in Japanese gardens and Shinto and Buddhist temples.

 

  
  

In conclusion, going to Japan and participating in such an event was well worth it. We hope to repeat such experiences in the future.

See you at IJCNN 2025, which will be held in our home country, Rome, at the end of June 2025.












A summary of the European AI Act

    Image source The advent of generative AI marks a profound " cognitive revolution ", transforming how we interact with technol...