Three PhD positions in the Data Mining group, TU/e
Three fully funded 4-year PhD positions are available in the Data Mining group, at the Department of Computer Science, TU Eindhoven, the Netherlands
- de Rondom, Eindhoven, Noord-Brabant
- Tijdelijk contract / Tijdelijke opdracht
- Uren per week:
- 38 uur
The Data Mining group at Eindhoven University of Technology (TU/e) is searching for candidates for three fully-funded 4-years PhD positions. All three positions involve fundamental scientific research: the primary goal is to enrich the portfolio of available data mining techniques with new innovative methods. Each position involves a specific project partner in industry, ensuring that your research will have direct impact in the real world. Hence, these positions offer the best of both worlds: theoretical research with direct practical impact.
The scientific framework in which the positions are grounded is Exceptional Model Mining, a data mining framework that strives to find subgroups within a dataset that are interesting. Subgroups are only deemed interesting if they satisfy two conditions. On the one hand, they must be interpretable: we must be able to succinctly describe the definition of a subgroup, so that the knowledge that they represent becomes actionable. On the other hand, they must be exceptional: they must display some kind of behavior that sets them apart from overall behavior. The scientific challenges revolve around how to efficiently search for subgroups, and how to express exceptional behavior such that the subgroups we find are meaningful.
TU/e is a dynamic, research-intensive university in the heart of Europe. TU/e is consistently ranked within the top-100 positions in several world rankings for its research and quality of education. The Data Mining group consists of 5 professors and 17 PhD students and postdocs. The research results of the group appear in top-tier conferences and journals including but not limited to IJCAI, AAAI, ICDM, SDM, KDD, ECMLPKDD, DAMI, MACH, and Nature Communications. The group has close collaboration with several leading companies, organizations and hospitals. The advertised positions are in collaboration with the Dutch National Police (Position A), Philips Research and the ZGT hospital (Position B), and Rabobank (Position C).
All 3 successful candidates will be based in the Data Mining group, where are opportunities to collaborate with other PhDs and to co-supervise MSc students relevant to PhD their research. The successful candidates are expected to be willing to spend some of their time at the Dutch National Police, Philips Research, and Rabobank, correspondingly. The research will be concluded with a PhD thesis. You will be supervised by dr. Wouter Duivesteijn and prof. dr. Mykola Pechenizkiy.
Position A. Exceptional behavior profiling / EMM in operational field labs at Dutch National Police.
The outlet center in Roermond is a hot target for pickpockets. Using a plethora of sensor data, the Dutch National Police is trying to detect the pickpockets before they strike. Since criminals can reasonably be assumed to behave differently from the overall population, it stands to reason that Exceptional Model Mining will be beneficial here.
A specific research challenge for this position lies in the following. The police possesses data characterizing how cars navigate the road network. If this data consists of 50% commuter traffic and 50% tourist traffic, the overall behavior is an average of these two populations. Hence, no single car will display what EMM typically designates as "the overall behavior". But then, if we strive to find subgroups behaving exceptionally, this behavior must be exceptional compared to what, exactly? Both theoretical solutions to this problem, as well as solutions that involve police experts in the loop, are to be explored.
Position B. EDIC - Exceptional and Deep Intelligent Coach.
Our increased life expectancy goes hand in hand with an increased number of years living with chronic conditions. The EDIC projects strives to develop an intelligent digital coach for deploying a healthy lifestyle for people suffering from such conditions. A key component of this coach is informed and interpretable complex event detection and model adaptation on heterogeneous and noisy data streams capturing patients' medial condition, lifestyle, and behavior. Your task is to perform the research necessary for fulfilling this component, within the wider EDIC consortium; this involves researchers from both TU/e and the University of Twente, the ZGT hospital driving the use cases (diabetes and obesity), and the companies Philips, Holst, and RRD who deliver the technology and know-how to build the physical coaching system.
A specific research challenge for this position concerns extending Exceptional Model Mining beyond traditional flat-table data. Traditionally, one assumes that all data is given, stems from one homogeneous probability distribution, and all observations are independent. The heterogeneous and noisy data streams generated in this project satisfy none of those traditional assumptions. A large conceptual step must be made within EMM to enable handling such data: how can one characterize exceptional behavior, if one can never oversee all behavior, and if one must compare across dependent observations?
Position C. Explainable and accountable KYC Predictive Analytics at Rabobank.
Analyzing customer transaction data is a longstanding form of data mining at banks. However, a bank interacts with customers along multiple channels: semi-structured text documents such as contracts, completely free-form text documents such as helpdesk chat logs, and the very clearly structured transaction data can combine to deliver a more complete image of the wants and needs of a customer. Rabobank strives to know their customer better, and Exceptional Model Mining plays a key role therein.
A specific research challenge for this position lies in extracting more knowledge on how a complex black-box classifier works. European legislation compels any company using automated systems for making decisions substantially impacting a customer, to deliver insights to that customer on how the decision was made. With the rise of complex classification models in the era of deep learning, this puts the state of the art in classification at odds with the demands on a company under the law. With Exceptional Model Mining, combining inherent interpretability with complex exceptional behavior, we can deliver deeper insights in how the classifier arrived at the decision. This combines fulfilling a company requirement under European law, with fascinating deeper insight in classifier behavior for scientists.
We are looking for candidates that meet the following requirements:
- a solid background in Computer Science with specialization in data mining, machine learning, deep leaning;
- a strong interest in data science research with focus on local pattern mining, exceptional model mining or related techniques;
- data mining software development skills at least in one language, e.g. R, Python, Java;
- good communication skills in English, both in speaking and in writing;
- capability and willingness to work both independently and in a team of data scientists and interact with domain experts; being highly motivated, rigorous, and disciplined;
- being enthusiastic about working on the challenging use cases;
- experience in research and a publication record will be considered additional advantages.
- A full time temporary appointment for a period of 4 years, with an intermediate evaluation after 9 months;
- A gross salary of €2.325 per month in the first year increasing up to €2.972 in the fourth year;
- Tight collaboration of academia with industry with access to real data and domain expertise.
- Strong collaboration ties with several research groups in Europe and worldwide.
- Healthy travel funding for presenting your work at the leading conferences and going on visiting research.
- Support for your personal development and career planning including courses, summer schools, conference visits etc.;
- A broad package of fringe benefits, e.g. excellent technical infrastructure, child daycare and sports facilities, extra holiday allowance (8%, May), and end-of-year bonus (8.3%, December).
Additional informationMore information
- For more information about the projects and open positions contact either prof. dr. Mykola Pechenizkiy (chair Data Mining) or dr. Wouter Duivesteijn by e-mail: m.pechenizkiy[at]tue.nl, w.duivesteijn[at]tue.nl.
- For more information about the employment conditions contact Arianne Boekema (HR advisor), e-mail: pzwin[at]tue.nl or by telephone: +31 40 247 5812.
The application should consist of the following parts:
- Cover letter explaining your motivation and qualifications for the position to which you apply; if you want to be considered for multiple positions, provide a separate motivation for each of these positions. Please append a short (at most a single page in A4-format) description of how you would approach the specific research challenge outlined for the position to which you apply.
- Detailed Curriculum Vitae, including contacts of at least two referees.
- A transcript of your grades for the Bachelor and Master level courses.
- A copy or a link to your MSc thesis. If you have not completed it yet, please explain your current situation.
- List of publications (with links to download) and/or software developed (e.g. links to GitHub).
The selection process will start on September 2, 2019, and will continue until the positions get filled. The positions are fully funded and immediately available. The start date is flexible.
Please apply by using the 'Apply now' button on top of this page. Applications submitted by e-mail will not be considered. You can upload one file including all the required documents (max. 10 MB).
Middels onderstaande knop kun je direct solliciteren op deze vacature.Reageer op deze vacature