Category: graduates tracking system

New MSc thesis concerning Big Data in UBB

New MSc thesis concerning Big Data in UBB

Today on University of Bielsko-Biala our graduate defend its MSc thesis.

The following publication ‘Sentiment and stock market analysis of listed companies using Big Data tools’ introduces the topic of Big Data with sentiment analysis. Big Data is having an increasingly important role in the modern world over time. With the help of huge data sets, many companies are able to predict customer demand and create personalised offers for them, or even react in advance to possible production failures. However, working with Big Data brings many challenges. Processing, storing and analysing huge sets of information requires special technologies to handle huge data records, as ordinary tools would not be able to handle such a amount of data.
One extensive source of Big Data is social networks. Users of such applications leave a lot of information about themselves, which is stored in extensive data stores, which correctly analysed, helps to present relevant offers, advertisements, information specifically personalised and designed exactly for the user.
In this study, research was carried out analysing the impact of Twitter users’ online statements on the share prices of selected companies. The research work was carried out over a period of one month. Conclusions were also presented for each company analysed.
However, the result of the research does not indicate a direct connection between the sentiment of the users’ statements and the increase or decrease in the value of the shares of the companies in question on the stock exchange. With such a complex process as stock market asset valuation, it should also be analysed with other factors, such as company valuations or the current global economic situation, which sets further directions for future work.

contact: lukasz1081@wp.pl

 

 

Project of building a data “pipeline” taking into account Big Data aspects

Project of building a data “pipeline” taking into account Big Data aspects

The University of Bielsko-Biala realizes the project as part of BSc thesis. The subject of the thesis is to get acquainted with the available technologies that enable working on large data sets referred to as Big Data and to design a data pipeline based on specific assumptions. The process of creating a data stream covered the processes of obtaining, processing, analysing, applying and storing data. When designing the data pipeline, the author of the work used Google Cloud Platform tools. The Google Storage component made it possible to access the data storage container. The Google Dataproc service made it possible to create and configure the Hadoop and Spark cluster. In the created example, the author showed the transformation process of data obtained from the Twitter social network, extracting from its data on the customer’s opinion about the surveyed company. The next stage of work concerned the analysis of the obtained data, for this purpose the Google Data Studio platform was used, with the help of which charts, reports and statistics were created, enabling an in-depth analysis of the studied example. The obtained data was compared with the information available on the market.

 

work supervisor: Marcin Bernaś (mbernas@ath.bielsko.pl)